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GEORGE DAVID BIRKHOFF 
1884-1944 

“Creative intellect is the finest flower of the race. It is by such,—and they 
are rare—, that the direction of our cultural evolution is determined. It is 
such as they who set our finest standards and who by their example inspire 
and hearten others in their striving for what always lies ahead,—renewed and 
ever greater accomplishments.” 

George David Birkhoff was a truly creative mathematical intellect. Where 
he set his hand to the plow, the furrow never failed to be deepened, if, in¬ 
deed, its direction was not soon materially altered. His genius guided him 
in the divining of many ways to significant and previously inaccessible re- 
su jt s —ways in which others have been content to follow him. 

Professor Birkhoff was born in the State of Michigan on March 21, 1884. 
He had his early schooling in the City of Chicago, at the Lewis Institute 
(now a part of the Illinois Institute of Technology) and at the University 
of Chicago. For his later college studies he transferred to Harvard, and there 
he also did his earlier graduate work, taking his bachelor's and master’s de¬ 
grees in 1905 and 1906. His doctorate of philosophy, for which he returned 
to the University of Chicago, was conferred upon him in 1907. The list of 
universities at which he functioned then as a regular teacher is not long. From 
1907 to 1909 he was an instructor at Wisconsin. From 1909 to 1912 he filled 
preceptorial and professorial positions at Princeton, and from then on to 
the time of his death he held professorial chairs at Harvard. As a lecturer 
or as a visiting faculty member, however, he filled places throughout the 
years at many institutions in this country and abroad. He died suddenly in 
his sixty-first year at his home in Cambridge, Massachusetts. 

Professor Birkhoff’s was a versatile mind. The American mathematicians 
under whose scientific and personal influence he seems to have especially 
fallen were Bdcher of Harvard, E. H. Moore of Chicago, and E. B. Van 
Vleck of Wisconsin. Most of all, however, he was intellectually a disciple 
of Poincar6. His mathematical publications began in 1904, while he was still 
an undergraduate, and continued throughout his life. They total roughly to 
about one hundred memoirs, and form a very substantial contribution to 
mathematics. With his doctoral work he began an important series of re¬ 
searches in the field of differential equations from which sprung important 
results in the theories of asymptotic solutions, of boundary value problems, 
of regular and irregular singular points, and of others. The field of dynamics 
occupied him throughout his life, and rewarded him with notable discoveries 
in the theories of stability, the problem of three bodies, Poincare’s geometric 
theorem, dynamical systems, and in ergodic theory. Over a course of years he 
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made important advances in the theory of both linear and g-type difference 
equations, and from the year 1922 on he wrote and lectured considerably 
upon the theory of relativity. Other fields to which he contributed by publica¬ 
tion include those of map coloring, mean-value relations, elasticity, the cal¬ 
culus of variations, modern physical theory, and aesthetic measure. 

The profundity of Professor Birkhoff’s work was universally recognized 
abroad as well as at home. In 1918 his researches in dynamics and differ¬ 
ential equations were crowned by the Querini-Stampalia prize of the Royal 
Institute of Science, Arts and Letters, of Venice. In 1923 the American 
Mathematical Society awarded him its Bdcher prize, and in 1926 he was 
awarded the annual prize by the American Association for the Advancement 
of Science. In 1927 he gave by invitation the Society’s Colloquium Lectures, 
and in 1933 he received the prize of the Pontifical Academy of Sciences. He 
was a member of the National Academy of Sciences, the American Academy 
of Arts and Sciences, and the American Philosophical Society. He was an 
“Officier de Legion d’honneur,” and a corresponding or honorary member of 
many European and Latin American scientific organizations. He was presi¬ 
dent of the American Mathematical Society in 1925, and President of the 
American Association for the Advancement of Science in 1937. Honorary 
doctorates had been conferred upon him by Harvard, Brown and the Illi¬ 
nois Institute of Technology, and by the Universities of Chicago, Wisconsin, 
Pennsylvania, Poitiers, Paris, Athens, Sophia, Lima, Santiago, Buenos Aires, 
and St. Andrews. 

Professor Birkhoff’s research papers are to be found in the mathematical 
journals of many countries, from the years 1904 to 1944. As books he pub¬ 
lished the following: Relativity and modern physics (1923); The origin, nature 
and influence of relativity (1925); Dynamical systems (1927); Aesthetic measure 
(1933). 

R. E. Langer 
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GEORGE DAVID BIRKHOFF 
(1884-1944) 

George David Birkhoff was born at Overisel, Michigan, on the 
twenty-first of March, 1884. His ancestry was Dutch on both sides. 
His father, David Birkhoff, came from Holland in 1870, and dur¬ 
ing George David’s growing years was a physician in Chicago. 
Birkhoff studied at the Lewis Institute, Chicago, from 1896 to 
1902, and at the University of Chicago for a year. After this he 
went to Harvard, where he received the Bachelor’s degree in 1905. 

Beginning in the year 1900 there appeared in the problem 
department of the American Mathematical Monthly , edited by B. 
F. Finkel, a series of notes, solutions, and problems by H. S. 
Vandiver, of Bala, Pennsylvania. In 1901 Birkhoff, who had 
doubtless found the Monthly in the old John Crerar Library, began 
exchanging letters about various questions in the theory of num¬ 
bers with Vandiver, who was then nineteen years old. This cor¬ 
respondence resulted in the publication in 1904 of their joint paper 
in the Annals of Mathematics “On the integral divisors of a n - b n .” 
So far as I know this was Birkhoff’s only publication in the theory 
of numbers, but Vandiver has told me that Birkhoff was in posses¬ 
sion in those days of at least one number-theoretical theorem 
which is now counted among the notable contributions of a distin¬ 
guished mathematician in another part of the world. In later life 
Birkhoff often showed an interest in number theory, but seems 
never to have taken the deep plunge which would have been neces¬ 
sary in order to bring up new results of the sort that would have 
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satisfied him. It was not until his Princeton period that he met 
Vandiver personally. 

During his undergraduate years he also made a definite begin¬ 
ning in analysis, as is proved by the fact that he read a paper en¬ 
titled “A general remainder theorem” before the American 
Mathematical Society in New York in February 1904 (Amcr. Math. 
Soc. Bulletin 10: 280). This was the basis of a paper entitled “Gen¬ 
eral mean value and remainder theorems with applications to me¬ 
chanical differentiation and quadrature,” published in the Transac¬ 
tions of the American Mathematical Society , volume 7 (1906). 

Birkhoff returned to the University of Chicago in the fall of 
1905 and received his Ph.D. summa cum laude in 1907 at the age 
of twenty-three. This is not an unusual age for a European doc¬ 
torate but, unfortunately for the New World, it is an exceptionally 
early one in the United States. Birkhoff’s student period had been 
divided between the only two great mathematical centers which 
existed in America at that time. From Osgood and Bocher he 
obtained a thorough introduction to the classical methods of 
analysis, and from E. H. Moore who was then at the outset of his 
adventure in “General Analysis,” a grasp of the abstract modern¬ 
istic ideas which have characterized so much of mathematics during 
the last four decades. Birkhoff reacted rather strongly against the 
latter and in favor of the former. His view was that while one 
should understand the analogies between the linear problems of 
analysis and those of classical geometry and algebra, his attention 
should be concentrated on strategically important specific prob¬ 
lems of the classical type. 

His doctoral dissertation on asymptotic problems of ordinary 
linear differential equations does in fact continue the tradition to 
which Bocher belonged. But it also uses the powerful methods of 
the Fredholm theory of integral equations and the broad general 
ideas which E. H. Moore was trying to exploit. It initiated a series 
of studies by which he left his mark on most of the principal 
branches of the theory of linear differential equations: regular 
and irregular singular points, expansion and boundary value prob¬ 
lems, separation theorems, and his generalization of the Riemann 
problem. With these researches it seems reasonable to group his 
work on matrices of analytic functions and his remarkable contri¬ 
butions to the theory of linear difference equations, as constituting 
one of the three principal periods of Birkhoff’s scientific activity. 
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In time, this period overlaps his whole career, but his most intense 
effort in these fields belongs to his earlier years. 

After receiving his doctorate in 1907, Birkhoff spent two years 
in Madison as an instructor in the University of W isconsin. Here 
lie learned more analysis from E. B. Van Vleck, and in particular 
had his attention directed toward linear difference equations. This 
period also includes his marriage in 1908 to Miss Margaret Eliza¬ 
beth Grafius, a union of mutual devotion and helpfulness which 
lasted throughout the rest of his life. There were three children, 
Barbara (Mrs. Robert Treat Paine, Jr.), Garrett, and Rodney. 
Garrett has already gained distinction as a mathematician of quite 
different tendencies from those of his father. 

In 1909 he came to Princeton University as a preceptor and 
was promoted to a professorship on the occasion of a call to Har¬ 
vard in 1911. At Princeton during this period a third significant 
current of American mathematical thought, a geometrical one. 
was gathering force. Birkhoff shared in the exploratory studies 
then being made of analysis situs, as it was called before being 
formalized into “topology,” and saw their close relation to the 
class of dynamical problems which were at this time taking definite 
form in his mind. Incidentally, he had more than one try at the 
four-color map problem, to solve which remained throughout life 
one of his dearest aspirations. 

In 1912 Birkhoff reconsidered the question of returning to 
Harvard, and accepted an assistant professorship in that uni¬ 
versity, in which rank he remained for another seven years. lie 
thus returned to the most stable academic environment then avail¬ 
able in this country, and settled into a long period of creative 
work undisturbed by the necessity, common in American univer¬ 
sities of this epoch, to build an environment in which scientific 
work can bear fruit. The final transition to Harvard was recog¬ 
nized by Birkhoff himself and his most intimate friends as mark¬ 
ing the end of the formative period of his career. I remember in 
particular a delightful letter which he received from E. II. Moore, 
ending with the words written out in bold characters, AVE 
ATQUE VALE. 

As remarked by Marston Morse, however, “Poincare was Birk¬ 
hoff ’s true teacher.” I remember well how frequently, in the 
walks which we used to take together during his sojourn in Prince¬ 
ton, Birkhoff used to refer to his reading in Poincare’s Les Meth- 
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odes Nouvelles de la Mecarique Celeste, and I know that he was 
intensively studying all of Poincare’s work on dynamics. In a very 
literal sense Birkhoff took up the leadership in this field at the 
point where Poincare laid it down. 

PoincarS died in 1912 and his last paper reached Princeton in 
the summer of that year. In it Poincare showed that the existence 
of periodic solutions of the restricted problem of three bodies can 
be deduced from a very simple-sounding geometric theorem. But 
he had not been able to prove the theorem except in special cases, 
and he felt that at his age (he was only fifty-eight when he died) 
he could not be sure of being able to return to it again, as he 
should have liked to do, after letting his ideas lie fallow for a 
while. Before the year was over Birkhoff had given a simple but 
profound proof of “Poincare’s Geometric Theorem.” The pub¬ 
lication of this proof in the Transactions of the American Math¬ 
ematical Society for January 1913 brought immediate and world¬ 
wide fame to its author, an acclaim which, for once, was justified 
by subsequent events. 

His researches in dynamics constitute the middle period of 
Birkhoff’s scientific career, that of maturity and greatest power. 
Their chief characteristics can be seen already in his first publica¬ 
tion, “Quelques theordmes sur le mouvement des systemes dynami- 
ques,” which appeared in the Bulletin de la Societe MathSmatiqxie 
de France in 1912. In this paper after a careful examination of 
the properties of stable motion, Birkhoff introduced his concept of 
“recurrent motion” which has played a role alongside the classical 
concept of periodic motion in all further discussions of the descrip¬ 
tive properties of dynamical trajectories. It is, for example, the 
starting point of the “symbolic dynamics” of Morse and Hedlund. 
While Poincare had made good use of topology in the theory of 
dynamical systems, it was Birkhoff’s merit to have powerfully 
supplemented this by the use of the Lebesgue measure theory. In 
the unfolding of the geometric picture of the general case in dy¬ 
namics, one of the significant stages was the introduction of the 
concept of “metric transitivity” which appeared for the first time 
in his joint paper with Paul Smith on “Structure analysis of sur¬ 
face transformations” in LiouvilWs Journal (1928) where it was 
applied to two-dimensional problems of a class more general than 
those of dynamics. This line of thought reached its climax in the 
winter of 1931-1932 when under the stimulus of closely related 
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discoveries by Koopman and von Neumann he succeeded in proving 
his justly famous “ergodic theorem.” Birkhoff’s ergodic 
theorem, though it does not completely solve the basic problem of 
statistical mechanics at which it is aimed, has reduced that prob¬ 
lem to a definite question about metric transitivity, and is also a 
milestone in the progress of measure theory. Birkhoff’s proof, 
which, characteristically, used the rough and ready tools picked 
up along the path which led him to it, has been replaced by simpler 
and more sophisticated methods, and there has grown up a rather 
extensive literature of “ergodic theory.** 

Most of Birkhoff’s publications in dynamics are devoted to 
dynamical systems of two or three degrees of freedom. Here he 
employed Poincare’s concept of a “surface of section” and the 
transformations in it determined by a family of dynamical tra¬ 
jectories. Poincare’s geometric theorem is a case in point. He also 
carried the use of the representation of trajectories by means of 
geodesics on surfaces considerably beyond the stage reached by 
Poincar6 and Hadamard. His “miniraax principle’* was the start¬ 
ing point of Morse’s “Analysis in the Large” which has done so 
much to make topology effective in analysis. 

Although Birkhoff’s most notable successes were in the geomet¬ 
rical aspects of dynamics, he did not neglect, nor was he deficient 
in power over the analytic formalism. He achieved as good a view 
of the whole field of theoretical dynamics as did anyone in his 
time. For more authoritative accounts and evaluations of Birk¬ 
hoff’s work both in this field and in what I have called his first 
period, I should like to refer the reader to the notices by E. T. 
Whittaker in the Journal of the London Mathematical Society, 
volume 20 (1945), and by Marston Morse in the Bulletin of the 
American Mathematical Society , volume 52 (1946). In addition, 
there are many interesting comments on his own work and revela¬ 
tions of his point of view toward that of his contemporaries, in 
Birkhoff’s address on “Fifty years of American mathematics” 
which was published in 1938 in a volume celebrating the semicen¬ 
tennial of the American Mathematical Society. 

The third phase of Birkhoff’s scientific career was that in which 
he sought to extend mathematical methods into other fields of 
thought,—physics, aesthetics, and even ethics. He was already 
speculating on the possibility of a mathematical theory of music, 
and indeed of art in general, while he was in Princeton. But he 
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did not give these ideas to the world until 1928 when he delivered 
one of the principal addresses of an international mathematical 
congress under the title, “Quelques elements mathematiques de 
l'art, 1 ' in the Salone dei Cinquecento of the Palazzo Vecchio at 
Florence. Later on, after much reflection and a trip around the 
world, he published his book Aesthetic Measure, in 1933. In 1942, 
“A mathematical approach to ethics” appeared in the Rice Insti¬ 
tute Pamphlets (vol. 28). These studies, though Birkhoff took them 
quite seriously, seem to me to be definitely less likely than his 
purely mathematical work to survive. 

Something similar, I think, must be said about his efforts in 
physics. Like Goethe and Hilbert, he always remained an outsider. 
It may have been that the very strength of his faith in mathemat¬ 
ical insight prevented him from properly appreciating the insight 
of the physicists. His active interest in physics seems to have be¬ 
gun with a course in relativity which he gave in the winter of 
1921-1922, and it continued increasingly up to the time of his 
death when lie was engaged in exploiting a gravitational theory of 
his own. As his contribution to physics there remain some unques¬ 
tionable improvements in mathematical technique, some criticisms 
of present tendencies, and a physical theory which can survive only 
if it passes the tests both of experiment and assimilability into 
the growing body of science. 

Among the unconscious revelations of the address on ‘‘Fifty 
years of American mathematics,” one of the most vivid is that of 
the depth and sincerity of Birkhoff’s devotion to the cause of math¬ 
ematics, and particularly of ‘‘American mathematics.” This, 
along with his devotion to Harvard, was always a primary motive. 
It may be added that a sort of religious devotion to American 
mathematics as a ‘‘cause” was characteristic of a good many of 
his predecessors and contemporaries. It undoubtedly helped the 
growth of the science during this period. By now, mathematics 
is perhaps strong enough in the United States to be less national¬ 
istic. The American mathematical community has at least been 
healthy enough to absorb a pretty substantial number of European 
mathematicians without serious indigestion. 

Birkhoff was always on the lookout for talent among the young 
mathematical aspirants who came to Harvard. I recently looked 
over some of his letters and found them full of comments on the 
young men for whom he had hopes. Some of the names I had 
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forgotten, but many of the comments are still enjoyable. His 
capacity for intelligent study of the qualifications and needs of 
younger mathematicians was used for the benefit of science on a 
much wider stage during the years (1925 to 1937) that Birkhoff, 
Bliss, and I were the mathematical members of the National Re¬ 
search Fellowship Board. I am sure that Bliss will agree with me 
about Birkhoff’s remarkable capacity for picking “the good ones” 
and guessing what they needed. While Birkhoff was subject to as 
many prejudices as most of us, he kept always what most of us 
lose as we grow older, the power to see people and events simply 
and naively rather than with reference to current opinion. 

Birkhoff unhesitatingly accepted the public responsibilities that 
came his way. He served as Dean of the Faculty of Arts and Sci¬ 
ences at Harvard from 1937 to 1939. He carried his share of mili¬ 
tary research work during both World Wars. He traveled exten¬ 
sively and accepted a large number of invitations to lecture, both 
those of an honorific sort and those that simply afforded an 
opportunity to extend mathematical culture into new areas. He 
did much of the unrewarded administrative work of the American 
Mathematical Society. For example, he served on the committee 
which, after a lively debate, decided to undertake the publication 
of Mathematical Reviews. After the main issues had been decided 
against his judgment, he cooperated loyally and actively in the 
working out of details. 

It is pleasant to record that Birkhoff received nearly all the 
distinctions, such as honorary degrees and elections to societies and 
academies, that can come to a mathematician, and received many 
of them at an unusually early age. He became a member of the 
American Philosophical Society in 1921 and was a frequent at¬ 
tendant at its meetings. 

During the last few years of his life Birkhoff knew that his 
heart was no longer as strong as it had been, but he never slackened 
up his scientific and other work. He died in his sleep on 
November 12, 1944. 

Oswald Veblen 




Reprinted from Bull. Amer. Math. Soc. t May, 1946, Vol. 52, No. 5, 
Part 1, pp- 357-391. 
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The writer first saw Birkhoff in the fall of 1914. The graduate stu¬ 
dents were meeting the professors of mathematics of Harvard in 
Sever 20. Maxime Bdcher, with his square beard and squarer shoes, 
was presiding. In the back of the room, with a different beard but 
equal dignity, William Fogg Osgood was counseling a student. Dun¬ 
ham Jackson, Gabriel Green, Julian Coolidge and Charles Bouton 
were in the business of being helpful. The thirty-year-old Birkhoff 
was in the front row. He seemed tall even when seated, and a friendly 
smile disarmed a determined face. I had no reason to speak to him, 
but the impression he made upon me could not be easily forgotten. 

His change from Princeton University to Harvard in 1912 was de¬ 
cisive. Although he later had magnificent opportunities to serve as a 
research professor in institutions other than Harvard he elected to 
remain in Cambridge for life. He had been an instructor at Wisconsin 
from 1907 to 1909 and had profited from his contacts with Van Vleck. 
As a graduate student in Chicago he had known Veblen and he con¬ 
tinued this friendsip in the halls of Princeton. Starting college in 
1902 at the University of Chicago, he changed to Harvard, remained 
long enough to get an A.B. degree, and then hurried back to Chicago, 
where he finished his graduate work in 1907. 

It was in 1908 that he married Margaret Elizabeth Grafius. It 
was clear that Birkhoff depended from the beginning to the end on 
her deep understanding and encouragement. Her varied talents and 
charm were reflected in the overflowing hospitality of their home. 
Their children, Garrett and Barbara (Mrs. Robert Treat Paine, Jr.) 
are well known to friends of Birkhoff. 

Birkhoff admired Moore of Chicago, but not to the point of imitat¬ 
ing him. He respected Bdcher no less, and did him the honor next to 
Poincare of following his mathematical interests. F. R. Moulton's 
study of the work of Poincard had something to do with Birkhoff’s 
own intense reading of Poincard. Poincard was Birkhoff’s true teacher. 
There is probably no mathematician alive who has explored the works 
of Poincard in full unless it be Hadamard, but in the domains of 
analysis Birkhoff wholeheartedly took over the techniques and prob¬ 
lems of Poincare and carried on. 

Birkhoff’s loyalty to Harvard was complete and only occasionally 
critical. The response of the non-mathematical members of the faculty 
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to his early successes was slow at first and he felt this. But he did not 
wait long before he was recognized by the University as a whole. It 
is a mark of the intellectual distinction of President Lowell that he 
hailed Birkhoff as one of Harvard’s greatest men and honored him 
accordingly. He was made Perkins Professor of Mathematics and 
served at one time as Dean of the Faculty of Arts and Sciences. 
Birkhoff’s colleagues in mathematics at Harvard were as one in their 
admiration of him and affection for him. 

Unlike his colleague Osgood, Birkhoff thought of his contempo¬ 
raries in Europe, particularly in Germany, as colleagues rather than 
as teachers. He held Klein lightly, was unenthusiastic over Weier- 
strass, but gave his full respect to Riemann. Through his papers on 
non-self-adjoint boundary value problems and asymptotic representa¬ 
tions he probably influenced the Hilbert integral equation school as 
much or as little as it influenced him. His relations with the members 
of the French and Italian schools of analysis were close, both per¬ 
sonally and scientifically. Levi-Civita and Hadamard were among his 
best friends. Birkhoff was at the same time internationally minded and 
pro-American. The sturdy individualism of Dickson, E. H. Moore and 
Birkhoff was representative of American mathematics “coming of age.” 
The work of these great Americans sometimes lacked external sophis¬ 
tication, but it more than made up for this in penetration, power and 
originality, and justified Birkhoff’s appreciation of his countrymen. 

Much has been said about Birkhoff's teaching, and each of his pu¬ 
pils will have his tale. There is one thing which may be noted in gen¬ 
eral: that there is an excellent correlation between the more distinr 
guished Ph.D.’s from Harvard and those who were inspired by Birk¬ 
hoff’s teaching. Among his “tutees” there was an unanalyzed appreci¬ 
ation of Birkhoff’s ability to transform a subject into something alive, 
and a deep gratitude for his painstaking personal effort with them. 
He never consciously descended to the level of the untutored, nor 
gave them evidence that he was aware of his own relative elevation. 
He often improvised in an exposition, thereby illuminating many 
things besides the immediate theorem. He once said that if his liveli¬ 
hood depended upon it he would vie with those most perfect in ex¬ 
position. Those who knew him best will agree that he could have done 
so and successfully. This demonstration however was never de¬ 
manded. 

In his social and political views, he was detached from the world 
about him and this was sometimes a source of misunderstanding. 
Proceeding from the dynamic individualism of the Middle West to 
an environment of tenacious self-sufficiency in New England, he un- 
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derstood and interpreted both as conditions for living. His philosophy 
of life was intellectual rather than emotional. His own conduct of 
life was extraordinarily well integrated, and free from manifestations 
of eccentricity. He showed his individualism most clearly in the broad 
range of suppositions which he could entertain and on which he could 
vividly speculate. The fact that these speculations were disassociated 
from his own life was one of the reasons why his extreme detachment 
was possible. He chose to be brief and simple, but not explanatory. 
He could examine a point of view as if it were his very own and later 
reflect with equal appreciation on an opposite point of view. A hearer 
who did not understand this range of his speculations could easily 
receive incorrect impressions of Birkhoff’s views. To understand 
Birkhoff one had to weigh and average his conclusions over a long 
range of circumstances. Birkhoff’s separation of thought and action 
was a part of his conscious desire to fight only one battle, the battle 
of mathematics. 

Birkhoff was uncompromising in his appraisal of mathematics— 
by the test of originality and relevance. For him the systematic or¬ 
ganization or exposition of a mathematical theory was always secondi 
ary in importance to its discovery. I recall his remarks on a mathe¬ 
matical treatise that had come to his attention and eventually had 
a wide circulation but which he did not regard as original. Birkhoff 
said M I read this book through in a half hour.” His choice of topics 
of investigation could be called objective rather than subjective. 
His problems were not necessarily chosen from among those which 
he could solve; indeed many of his papers from 1920 on were on prob¬ 
lems which he left unsolved. Some of the current mathematical theo¬ 
ries were regarded by Birkhoff as no more than relatively obvious 
elaborations of concrete examples. 

The major interest in Birkhoff in the future will undoubtedly center 
about his mathematical theories and this report is devoted to their 
exposition in brief. 1 

Popular opinion focuses attention on two striking achievements, 
his proof of Poincare’s Last Theorem and the Ergodic Theorem. 
By contrast much of his other work is obscured. Important as they 
are, it is to be doubted whether either of the above theorems is more 
substantial than other phases of his work. His magnificent theory of 
the generalized Riemann problem, his contributions to difference 
equations, his formal theory of stability in dynamics, and the con- 


1 In reviewing the research of Birkhoff the writer has had the assistance of Dr. 
Pesi Masani, particularly in the study of Birkhoff’s early work. 
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sequent geometrical theory, all involve structures upon which much 
can and probably will be built. 

In the domain of transitivity in which the Ergodic Theorem lies, 
the basic hypothesis of metric transitivity remains a conjecture as 
far as analytic dynamical systems of classical type are concerned. 
The relative importance in mathematics of statistical or symbolic 
dynamics can hardly be assessed until it is known whether or not 
topological transitivity in an analytic system implies metric trans¬ 
itivity. The importance which Birkhoff attached to symbolism in 
dynamics is made evident by the fact that his last papers on dynamics 
were largely concerned with the search for a general symbolism char¬ 
acterizing a dynamical system. 

Birkhoff’s first paper [l ] 8 was written with H. S. Vandiver under 
the title On the integral divisors of a n — b n . Birkhoff was eighteen and 
Vandiver twenty. An immediate consequence of the principal theo¬ 
rem was that there are an infinity of primes congruent to 1 , mod n, a 
result known to Kronecker, Hilbert and Sylvester. The principal theo¬ 
rem is new. One consequence of this collaboration was the high regard 
for Vandiver’s work which Birkhoff held throughout his life. 

To continue with a systematic review of Birkhoff’s work it will be 
convenient to divide his papers into six principal groups. Various sub¬ 
jects such as the four-color problem will be referred to separately. 

I. Asymptotic expansions, boundary value problems, 
Sturm-Liouville theorems 

(a) Asymptotic expansions and boundary value problems. The years 
of Birkhoff’s graduate study were major years in the development of 
integral equations. Parallel developments in terms of Green’s func¬ 
tions were of great interest. Self-adjoint problems corresponding to 
symmetric kernels had been extensively studied, but when n> 2 the 
case of a non-self-adjoint operator 

L (*) = — + • + PM + * # • + PMz (a £ x £ b) 

had not been adequately treated. 

The n boundary conditions in [3] 

d-l) w<(u) = 0 («- 1,'" , ») 

were linear and homogeneous in u and its first n —1 derivatives taken 
at x = a and x = b, and were linearly independent. The functions Pi(x) 

1 Numbers in brackets refer to the Bibliography at the end of this paper. 
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with all their derivatives were assumed continuous on the closed in¬ 
terval (a, b). The operator M(z) adjoint to L(z) was defined in the 
classical way. With L(u) and the conditions (1.1) Birkhoff was the 
first* to associate the adjoint boundary conditions 

VM = 0 (i = 1, • • • , n) 


in the now accepted manner. The parameter X was introduced, giving 
the “problem” 


(1.2) 

L(u) + Xu = 0, 

wm - 0 

(i = 1, • • • , n) 

and the adjoint problem, 



(1.3) 

M(v) + Xt. = 0, 

K,(r) = 0 

(* - !.•••» «)• 


In a paper [2] which constituted the first part of his thesis, Birk¬ 
hoff had given an elegant asymptotic representation of solutions of 
differential equations of the form 

d n z d*-'z 

(1.4) — + i(x, p) — n _ i + ••'•+ P n ao(x, p)z = 0 

for large values of |p|. The differential equations in (1.2) and (1.3) 
took the form (1.4) on setting X = p\ With the aid of these asymptotic 
representations Birkhoff proved the existence of an infinite sequence 
Xi, Xi, • .* • of characteristic values with corresponding solutions 

Ui(x), u t (x), • • , 

»*(*), • • • 

of (1.2) and (1.3) respectively. He was able to do this by imposing 
certain determinant conditions on the coefficients in the boundary 
conditions. 

The function f(x ) to be expanded was assumed real and piecewise 
continuous with a continuous derivative. In the case in which the X, 
are zeros of the first order of the determinant A(X) defining the char¬ 
acteristic roots, the expansion 

0 - 5 ) jtciUi(x) 

in which 

c = f*f( x )Vt( x )dx 
fiui(x)Vi(x)dx 

a As stated by Bdcher. Applications and generalization of the conception of adjoint 
systems, Trans. Amer. Math. Soc. vol. 14 (1913) pp. 403-120. 
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converges (as does a Fourier expansion) to 

/(* + Q)+/(*-Q) 

2 

at each inner point of (a, b), to 

* a x f(a + 0) + a 2 f(b - 0) 

at x = a and to 

bif(a + 0) + b 2 f(b - 0) 

at x=b where a x , a 2 , b lt b 2 are constants, independent of /(*). In case 
X,- is not a zero of the first order of A(X) the corresponding term in 
(1.5) is to be replaced by a term readily defined with the aid of the 
Green’s function. The proof of the convergence depended, as with 
Poincar6, on the method of contour integration in the X-plane. 

The interest in this work of Birkhoff is signaled'by a historic inter¬ 
change between Birkhoff and J. D. Tamarkin. Tamarkin had inde¬ 
pendently worked out asymptotic representations similar to those of 
Birkhoff without publishing them, and was prepared to treat the 
problem of Birkhoff when he received reprints from Birkhoff. Birk¬ 
hoff's papers were published in 1907 and 1908 respectively and 
Tamarkin’s paper 4 was published in 1912. In this paper Tamarkin 
stated that he found Birkhoff’s reasoning brief, and on certain points 
insufficient. It should be stated that Birkhoff had treated the case 
of an even n in detail and had indicated the proofs when n was odd. 
In twelve pages in [6] Birkhoff showed that his earlier proofs could 
be carried through in general. In a note 6 in the Rendiconti, Tamarkin 
reviewed the matter at issue and accepted Birkhoff’s supplementary 
proof. 

Problems similar to those treated in [2] and [3] were taken up at 
length in Birkhoff’s and Langer’s paper [8 J where the case of a sys¬ 
tem of ordinary differential equations was treated. In this paper may 
be found references to extensions of the theory by Hopkins and 
Wilder, pupils of Birkhoff. 

(b) Separation and comparison theorems. The existence of charac¬ 
teristic roots in boundary value problems in the plane and the oscilla¬ 
tion properties of characteristic functions are treated in [4] follow- 

4 J- D- Tamarkin, Sur quelques pointes de la theorie des equations differ entitles 
lintaires ordinaries et sur la generalization de la serie de Fourier , Rend. Circ. Mat. 
Palermo vol. 24 (1912) pp. 345-382. 

* J- D* Tamarkin, Sur un probUme de la theorie des equations differcnliellcs lineaircs 
ordinaires, Rend. Circ. Mat. Palermo vol. 37 (1914) pp. 376-378. 
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ing the methods of Sturm, Liouville, and Bdcher (n = 1). The exist¬ 
ence of infinitely many characteristic roots is established in the 
self-adjoint case and a general oscillation theorem is obtained. 

Around 1930 the new techniques auxiliary to the “variational the¬ 
ory in the large” opened up the separation-comparison-oscillation 
theory for self-adjoint systems of n second-order ordinary linear dif¬ 
ferential equations. The Sturm separation theorem was generalized 
by the theorem that the conjugate points on an arbitrary interval J 
of a point x=a of the x-axis differ in number by at most n from the 
conjugate points on J of a point x = b. The boundary conditions were 
expressed in terms of a minimum set (u) of parameters and an acces¬ 
sory quadratic form q(u ) defined in the second variation. A necessary 
and sufficient condition for the existence of infinitely many charac¬ 
teristic roots was obtained. 

In these terms it was possible for the first time to give a systematic 
classification of the possible separation, comparison and oscillation 
theorems, and with the aid of Morse’s “index form” to establish the 
principal results. In [9] Birkhoff and Hestenes turned from the older 
methods of Sturm to the new methods, adding an isoperimetric inter¬ 
pretation to the proofs. A detailed review of [9] is here impossible. 
Fortunately, a recent “hour address” by W. T. Reid® covers this field 
in an excellent manner both historically and mathematically, and the 
interested reader is referred to this address. 

In a short note [7] in 1917 Birkhoff gave a simple condition that a 
normalized set of orthogonal functions u t (x) be closed, and applied 
this to Sturm-Liouville series. The principle involved in the applica¬ 
tion could be stated as follows: any set of orthogonal vectors in a 
functional space lying near enough (in a sense defined by Birkhoff) 
to a complete set in that space is itself complete. Birkhoff was here 
sampling a great theory, a theory in which the name of his pupil and 
colleague Marshall Stone was later to become illustrious. 

Perhaps the most remarkable of the contributions of Birkhoff in 
the domain of separation and comparison theorems is in [s] and con¬ 
cerns an ordinary third order linear differential equation. This equa¬ 
tion does not lend itself to treatment by the Sturmian methods nor 
has it yet been effectively brought into the domain of the variational 
theory. A solution 

*(*) (* = 1, 2, 3) 

is thought of as a curve in a 2-dimensional projective space. Birkhoff 

.. * W : T - Re l id :, B Z’^ ary mlue t" oblems of ,h * wkulus of variations. Bull. Amer. 
Math. Soc. vol. 43 (1937) pp. 633-666. 
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utilizes the projective differential geometry of Darboux and Wilcyn- 
ski and by elegant methods far removed from all other methods 
arrives at theorems very similar to the classical ones. This part of 
the field deserves further study. 

II. Linear differential equations and the 

GENERALIZED RlEMANN PROBLEM 

With Gauss, Riemann, and Poincare leading the way, ordinary 
linear differential equations of the second order of Fuchsian type have 
been studied by almost every analyst of note. In mathematical 
physics, equations of this type bear the name of Bessel and Legendre. 
In complex function theory such equations lead by way of conformal 
mapping to the elliptic modular and automorphic functions. In real 
function theory integral relations such as 7 

J"{4(/)* - y»/**J dx > 0 (y(0) = 0) 

are typical of a class of relations which arise 8 from Fuchsian differ¬ 
ential equations with two singular points. Much of B6cher s research 
was concerned with these equations. It was natural therefore that 
Birkhoff should try his hand in this central field. 

Departing somewhat from the order in which Birkhoff's papers on 
this subject appeared we refer first to [ll) on A simplified treatment of 
the regular singular point. At a regular singular point the canonical 
solutions have singularities (if any) of a well known type. This fact 
had been previously proved by the method of Frobenius, whereby 
the coefficients of a formal solution were first obtained and the solu¬ 
tion then proved convergent. Birkhoff gives a direct proof, without 
use of a series expansion, and this proof is barely two pages in length. 

Birkhoff was not content with this achievement. Linear differen¬ 
tial systems with irregular singular points presented a field of great 
difficulty and next challenged his attention. 

No mathematician of Birkhoff’s day possessed greater facilitiy in 
combining formal algebraic methods with penetrating numerical 
analysis. Birkhoff needed all of this power in attacking the irregu¬ 
lar case. Thome was his predecessor in using formal solutions of a 
general type. Poincar6 and Horn 9 had effectively introduced asymp- 

7 Hardy, Littlewood, P6lya, Inequalities, p. 17S. 

• Marston Morse and Walter Leighton, Singular quadratic functionals, Trans. 
Amer. Math. Soc. vol. 40 (1936) pp. 252-286. 

9 See Birkhoff's papers for references. References which may be found in the papers 
of Birkhoff are generally omitted. 


XXX 



GEORGE DAVID BIRKHOFF AND HIS MATHEMATICAL WORK 365 


1946J 

totic representations. Birkhoff had used formal series and asymptotic 
representations in the study of boundary value problems. Hilbert 
and Plemelj, prior to Birkhoff and unknown to Birkhoff, had already 
solved one of the matrix problems relevant to the theory. Cunning¬ 
ham had generalized Poincare’s use of Laplace transformations in re¬ 
ducing the problem. 

There was thus a great technique available to Birkhoff when he 
began his research. But with all this power no one had carried the 
theory through to a triumphant end. Birkhoff in [10, 13] took up the 
problem with typical boldness. He added the concepts of canonical sys¬ 
tems of differential equations and of equivalence of such systems. He 
carried through the asymptotic representation of the matrix solution in 
the general irregular case, enumerated the essential u characteristics ” of 
a singular point of arbitrary rank, defined and solved the 11 generalized 
Riemann problem ” 

Such were Birkhoff’s achievements in this field. Although any ex¬ 
plicit formulation is complicated, it is needed if the reader is to sense 
the depth of Birkhoff’s mastery and skill. 

The differential equations can be represented in the form 

v dyi 

(2* 1 ) = a iMyj (i,j - 1, • • • , n) 

using the summation convention for repeated indices. Here z is a 
complex variable, and y,(z) the unknown ith component of a solution 
(y). In any local study the singular point can be taken as the point 
z= co. The functions «</(*) are assumed analytic at z= « except at 
most for poles. The maximum order of these poles is denoted by q 
and 2+1 is termed the rank of the singularity z= 00. When q< -1 
the singular point is ordinary. When q= -1 the point is regular, and 
when 2^0 irregular. The irregular case 2 = 0 had been reduced to the 
regular case q= -1 by Poincare on using a Laplace transformation 
of the variables (y), and Poincare had indicated a possible reduction 
of the irregular case 2>0. 

To simplify the system (2.1) Birkhoff uses linear transformations 
of the form 

(2 - 2) (i,j= l. •••,«) 

in which the functions Ctj(z) are analytic at infinity and have there 
a nonvanishing determinant. Any system into which the system (2.1) 
goes under (2.2) is said to be equivalent to the system (2.1) at infinity. 
Birkhoff shows that a system (2.1) of rank g+1^0 is equivalent to a 
canonical system 
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(2.3) zyl = q*Myj (*. i = 1, • • • , n) 

again a/ ran£ g-f-1 in which qu(z) is a polynomial of degree at most q + 1. 

In proving this theorem Birkhoff used a lemma on analytic mat¬ 
rices [12], Unknown to Birkhoff at the time, this lemma followed 
from general theorems of Hilbert and Plemelj. The canonical systems 

(2.3) are of great value in formally reducing the complexity of the 
system. 

With Thom6, n formal solutions each of the type 

(2.4) y< = (t = 1, • • • , n) 

are next determined. Here £,(z) is a formal series of nonpositive pow¬ 
ers of z such that at least one of the n constant terms (i = 1, 2, • • • , n) 
is not zero while p(z ) is of the form 

az*+ l Bz « 

(2.5) p(z) = - ~ - H-h * • • + hz. 

9+1 9 

The coefficient a is a characteristic root of the matrix of coefficients 
of z« +1 in the polynomials qi,(z). These roots a, are assumed distinct. 
For each of the n formal solutions there is thus a polynomial p(z), 
an exponent n, and a formal series B,(z). These solutions with 
j= 1, • • • , n may be chosen so that their determinant |^»,( co )| 5*0. 

By making generalized Laplace transformations of the variables y< 
in (2.3) together with other linear transformations of elementary 
character, Birkhoff shows that the rank of the differential system at 
its different singular points can be reduced step by step until the 
system is regular. This process leads to solutions in the form of gen¬ 
eralized Laplace integrals. Each of these integrals is asymptotically 
represented by a formal solution of the type (2.4) in an appropriate 
“sector” of a neighborhood of z= «>. These sectors are bounded by 
the m =*n(n — l)(g+l) rays on which (/?=real part) 

R[z*+'(ai - ay)] = 0 (i * j; i, j = 1, . • • , n). 

A base W r of these solutions can be chosen in the rth sector 
(r*=l, • • • , m) so that the base W r * belonging to the succeeding 
sector is obtained from the base W r by multiplying W r on the right 
by a matrix of the form /+C r , where I is the unit matrix and C r is a 
matrix with just one non-null element c r . 

These “transformation constants” c it • • • , c m , together with the n 
exponents m and the n(q+ 1) constants 

(2.6) (or,-, ft, • • • f X<) (i=l,***, n), 
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of which the ith set appears in the tth solution of a formal matrix 
solution, are termed by Birkhoff the characteristic constants of the 
singular point at infinity. With these constants given he shows that 
there exists a locally defined canonical system (2.3) to which these 
constants will be equivalent in the above sense. 

The preceding theorem completes the local analysis and Birkhoff 
turns next to the problem in the large. Let 5 be a differential system 
of the type (2.1) with at most a finite number of regular or irregular 
singular points 

(2.7) «!.•••#«* 
of ranks 

( 2 . 8 ) qi, •••,?* 

respectively. Corresponding to a base Y of solutions of S there exist 
matrices Ti, • • • f T k of constants such that Y is replaced by YTt 
when a positive circuit of a* is made. The matrices Ti generate the 
monodromic group G with a relation 

( 2 . 9 ) r,... r* = / 

corresponding to a circuit which encloses the points (2.7). 

Birkhoff says that two matrices K,(z) and Y 2 (z) whose elements 
are analytic in the vicinity of z—a are properly equivalent at z=a if 

(2.10) Y x (s) = A(z)Y 2 (z) (z*a) 

and A(z) is a matrix of elements analytic at z=a with \A(a)\ ^0, and 
improperly equivalent if (2.10) holds and the elements of A(z) are 
analytic at z = a except at most for poles. He then proves the follow¬ 
ing theorem: 

Let a lt • • • , a* be k given points. Let T lt • • • , T k be matrices of con¬ 
stants such that (2.9) holds. Let Zifz) (t = l, • • • , k) be matrices of 
functions, analytic of determinant not zero in the vicinity of z = a { and 
undergoing a transformation to Zi(a)T { as z makes a positive circuit of a 
There exists a matrix Y(z) of functions not zero forz^a h • • • , a k and 
analytic save at these points, which undergoes a transformation to YT t 
as z makes a positive circuit of as, furthermore Y(z) is properly equiva¬ 
lent to Zi at at (i = l, • • • , k — 1) and properly or improperly equivalent 
to Zt, at a*. 

This theorem contains as a special case a theorem of Hilbert and 
Plemelj in which the 77s are given but not the Z<’s, while Y{z) is 
affirmed to exist and possess elements which are analytic to finite 
order. The above theorem of Birkhoff gives a first type of generaliza- 
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tion of the Riemann problem. A second type of generalization which 
includes two problems proposed by Riemann is formulated by Birk¬ 
hoff as follows. 

To construct a system of linear differential equations of the first order 
with prescribed singular points (2.7) and with a given monodromic group, 
the characteristic constants being assigned for each singular point. 

In order that a solution exist it is necessary that the relation (2.9) 
hold and that the elementary divisors of 7\ be those of the trans¬ 
formation matrix determined at a, by the “transformation constants” 
belonging to a<. When these conditions are satisfied Birkhoff shows 
that the problem admits a solution provided one replaces the expo¬ 
nents m (i = 1, • • • , n) belonging to some one of the singular points 
by new exponents /x<-f s where 5 is a suitable integer independent of i. 

This concludes Birkhoff’s memorable work on differential systems. 
A byproduct was a number of theorems on the representation of ana¬ 
lytic matrices. BirkhofF returned to this subject in a paper [14] on 
Infinite products of analytic matrices where he generalized the theo¬ 
rems of Weierstrass and Mittag-Leffler on the representation of an 
analytic function by an infinite product. 

III. Linear difference equations 

In his study of the Riemann problem Birkhoff had mastered and 
extended the use of formal solutions and asymptotic representations. 
He had used the formal expansions not only to obtain asymptotic 
solutions but to find the invariant characteristics of the singular 
points and the a priori limitations on the monodromy group. This use 
of formal solutions was to remain characteristic of his mode of at¬ 
tack in at least three other main fields, linear difference equations, 
stability in dynamical systems, and surface transformations with an in¬ 
variant integral. The underlying analogies were great but the innova¬ 
tions which remained to be made in each case were much greater and 
would discourage most mathematicians. 

In his Fifty years of American mathematics [54] Birkhoff writes that 
it was the lectures of Van Vleck at Wisconsin on difference equations 
that led him to an appreciation of the open problems in this field. 
The thesis of Carmichael in 1911 under Birkhoff at Princeton was 
perhaps the first significant American contribution on difference equa¬ 
tions. Birkhoff’s own work began with [l5], published in 1911 in a 
volume of the Transactions that also contained Carmichael’s thesis. 
Birkhoff [15 ] dealt 10 with a system of n linear homogeneous differ- 

10 The writer is indebted to C. R. Adams for a summary of BirkhofTs work on 
difference equations. 
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ence equations of the first order rather than with a single equation 
of the nth order studied by Carmichael. The system was assumed reg¬ 
ular in the sense that the characteristic roots were simple and neither 
0 nor «. In matrix form the system took the form 

Y(x + I) = A(x)Y(x) (|.4(*)|^0) 


with A(x) a square matrix of rational functions. Two formal matrix 


solutions arose 


A-'(x)A~'(x +!)•••, 


A(x- l)A(x-2) • • • 


leading to two “principal solutions” Y+(x) and FL(x). These princi¬ 
pal solutions were analytic except for poles in the finite x-plane, and 
were asymptotically represented in the left and right hand planes re¬ 
spectively by formal solutions. Results similar in character to these 
had been established by Norlund and Galbrun a few months earlier 
using methods based on Laplace transformations. 

The matrices of constants which are used in the theory of linear 
differential systems are here replaced by matrices P(x) of functions 
with a period 1. In particular a matrix P(x) of this type was deter¬ 
mined such that 

Y-(x) = Y+(x)P(x). 


The “characteristic constants” of the system include the constants in 
P(x) and certain other constants defined by formal solutions in series 
form. The “generalized Riemann problem” was to determine a system 
of difference equations whose principal solutions have the given char¬ 
acteristic constants. Properly modified, this problem and its analogues 
for ^-difference equations were solved in [13]. 

The irregular case remained unsolved at this time although Adams 
had carefully examined the extent to which Birkhoff’s methods so far 
published were sufficient. In [16] (1930) Birkhoff presented a com¬ 
plete solution of the formal problem with strong analogies with the 
corresponding solution in differential equation theory. His last major 
paper [17] in this field was a significant collaboration with Trjitzinsky 
which extended and modified Birkhoff’s earlier methods introducing 
new devices adequate for the analytic treatment of the general ir¬ 
regular case. 

Birkhoff’s writings in this field occurred in two short periods seven¬ 
teen years apart. His interest in the field remained keen throughout 
his life as is shown by the doctoral theses written under him. His 
students in this field include R. D. Carmichael and K. P. Williams 
at Princeton; P. M. Batchelder, C. R. Adams, O. E. Lancaster, P. C. 


XXXV 



MARSTON MORSE 


370 


IM.y 


Guenther and Jeanne S. LeCaine at Harvard. The last two students 
received their degrees in 1941. 

IV. Dynamical systems 

As with Poincar6 the history of Birkhoff’s researches in dynamics 
is one of successes which are partly complete and partly incomplete. 
The grand aim was to give a formal normal reduction of a dynamical 
system which distinguishes equilibrium points as to stability, and en¬ 
ables one to pass from these forms to a complete qualitative charac¬ 
terization of the system. Hamiltonian and Pfaflian systems formed 
the central core with the restricted problem of three bodies [20, 32, 
39] a typical example. The formal aspects of Birkhoff’s research in 
dynamics may be regarded as relatively complete. Normal forms of 
great generality were obtained. 

On the qualitative side Birkhoff’s success was unconditional in 
many important phases. His central and recurrent motions belong to 
a theory which is valid for systems with any degree of freedom. His 
ergodic theorem is, however, conditioned in its usefulness by the un¬ 
certainty as to the hypothesis of metric transitivity. By far the 
greater part of his qualitative analysis was restricted to systems with 
two degrees of freedom, m= 2; this is inherent in his dependence on 
“regular surfaces of section” whose existence seems likely for m= 2 
but is in general unproved, and whose existence for m>2 is doubtful. 
Assuming metric transitivity and the existence of surfaces of section 
(m =2) he is able to reveal an astonishing range of ordered but intri¬ 
cate dynamical phenomena; in this he is close to realizing a complete, 
qualitative characterization. 

As did Poincar£, Birkhoff seeks to make the periodic orbits central 
in the theory. Poincare’s conjecture that these orbits are everywhere 
dense in phase space turns out to be false in special cases but is be¬ 
lieved true in general for m= 2. Poincare was stopped by the inade¬ 
quacy of his method of analytic continuation of these orbits. In a 
series of studies Wintner 11 and others have penetrated deep into the 
mystery, but in the general case much remains to be done. It is the 
conjecture of the writer that the method of topological continuation 12 
which Birkhoff does not use may be an essential key to the situation. 
We shall continue with a summary of various aspects of Birkhoff’s 
theory. 

“ A Wintner, Grundlagen einer Genealogie der pcriodischen Bahnen im reslringierten 
Dreikorptr Problem, Math. Zeit. vol. 34 (1931) pp. 321-349, 35CM02. 

12 See Marston Morse, The calculus of variations in the large, Amer. Math. Soc. 
Colloquium Publications, vol. 18, New York, 1934, chap. 9. 
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(a) Formal dynamics. In general form a dynamical system is locally 
defined by differential equations [26] 


(4.1) 


did 

— = F,{x i. •••.*») 

dt 


(*- !,•••.»). 


The functions F* are assumed analytic in the neighborhood of an ini¬ 
tial point (x°). Hamiltonian systems have the special form 

dpi dU dqi dH .. . 

< 4 - 2 > “ ^7’ 


where 7/ is a function of 2m variables 

(P\, ■ ■ ■ . pm-.qu ■ • • , ?-). 

Pfaffian systems are defined by the Euler equations of an integral 


(4.3) 


J [*/(*!» * * • » *lm)xf + Z(x 1 , • ‘ • , X im )]dt 


>n the variational problem in nonparametric form in the space 
(*,, • • • » xtm, /)• In order to be able to solve the Euler equations 
for the x/ it is assumed that the determinant of coefficients of these 
variables in the Euler equations is not zero. The Hamiltonian and 
Lagrangian equations also admit the Hamilton and Jacobi variational 
forms. 

Birkhoff speculates on the meaning of these variational forms. 
His views seem to have evolved somewhat with the passage of time. 
In [26] he points out that these variational forms have no “special 
significance” near a point which is not a point of equilibrium. At a 
point (*°) of equilibrium 


(4.4) 




a - 1 . 


in (4.1). He indicates the formal convenience of the variational form 
when a transformation of coordinates is to be made. He later shows 
[25] that the Hamiltonian form is one to which (4.1) can be formally 
reduced neighboring a point of completely formal stability of general 
type (to be defined later) provided the formal transformations used 
are sufficiently general in character. Finally in 1943 [44, p. 310] he 
writes as follows: 

“It is only the fact that there is a single explicit variational form 
available in the entire domain of the independent variables that is 
really significant. Possibly this interesting situation indicates that the 
basic importance of variational principles will be found to be topologi¬ 
cal.” (Italics are Birkhoff’s.) 
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The situation is relatively clear in ordinary dynamics. The writer 
wonders in what form topology will first appear in quantum me¬ 
chanics. 

Most of Birkhoff’s work is concerned with systems (4.1) which in¬ 
volve / in the functions F*. This occurs whenever a periodic 1 * orbit 
Xi=Xi(t) is given and one then makes a change of coordinates 

Xi = *<(/) + Si (* - 1, • • • t n). 

Starting with (4.1) in its original form this transformation leads to a 
new function F% of (£) and of / as well, which vanishes at the origin 
in the space (£) for all values of t. The origin (£) =0 is then termed a 
point of generalized equilibrium. 

Neighboring such a point the modified system (4.1) is subjected 
to formal transformations in which x, is replaced by a formal series 
in new variables y, with coefficients which are analytic and periodic 
in t. These coefficients are permitted to have complex values but in 
such cases the transformation is to be such that the new variables y< 
can be arranged in pairs which are formally conjugate in terms of the 
original real variables. In the case of the Hamiltonian system the 
new variables (/>,, q f ) are taken as conjugate. 

Corresponding to an equilibrium point the equations 14 of variations 
of (4.1) can be brought to a canonical form. In the so-called general 1# 
case this canonical form is 

dy j 

(4.5) -= X/yy (j not summed = 1, • • • , n) 

dt 

where the X/s satisfy no relation 

tf*iXi + • • • + *w*X n + m„fi(— 1) 1/2 = 0 

in which the m/s are integers not all zero. In the case of the Hamil¬ 
tonian or Pfaffian system the multipliers can be grouped in m pairs 

(Xi, Xj), • • • , (X mi X m ) 

where the X/s are either real or pure imaginary. With this under¬ 
stood a prime result of Birkhoff is as follows [26]. 

By suitable formal (generally divergent) transformations a Hamilto¬ 
nian or Pfaffian system with a generalized equilibrium point of u general” 
type at the origin may be taken into a normal Hamiltonian form in which 

u It will be convenient to take this period as 2r throughout. 

14 Involving the terms of first order in (4.1). 
u The “general” case is referred to throughout in this sense. 


xxxvui 



GEORGE DAVID BIRKHOFF AND HIS MATHEMATICAL WORK 


1^46! 

H(p, q) « a formal power series in the products (/>,<?,) starting with the 
quadratic terms 

(4.6) liPAi 0* = 1. - • , m). 

The case in which H reduces to the quadratic terms (4.6) is termed 
degenerate. Birkhoff is largely concerned with a “generalized equilib¬ 
rium point” of “general” type with a “non-degenerate” Hamiltonian 
function H, as these terms have been defined. 

The representations (4.1) which have been described are local. The 
whole of “phase space” is supposed “covered” by a finite number of 
such local representations each defined in a region of a space of pa¬ 
rameters (x). The phase space M is here supposed to be compact. A 
transformation T of M is defined in which each point P corresponds 
under T to the point P t on the trajectory through P. The origin for 
t is so selected that P 0 -P. This one-parameter family of transforma¬ 
tions T is called a. flow. In the case of certain systems (that is, Hamil¬ 
tonian systems) there exists an integral of the form /( V) where V is a 
measurable subset of M such that 

m-KW)) 

where T(V) is the image of V under T. The integral f(V) is then 
termed invariant and the system conservative. The integrand of / is 
supposed positive and may ordinarily be taken as analytic. 

Various types of stability are met, and these must be carefully dis¬ 
tinguished. Historically, one of the most important is the property 
of a conservative system whereby the images of any open region R 
under the flow intersect R for values of t which cluster at «>. 

First order formal stability is defined relative to a generalized equi¬ 
librium point of a Hamiltonian or Pfaffian system. It requires that 
the multpliers X, be purely imaginary. 

Formal trigonometric or complete stability occurs as follows. Let T 
be a time interval, m any positive integer, P , any polynomial in (x) 
with coefficients analytic and periodic in t and of lowest degree s. 
It is then possible, for e>0, to approximate P„ for time intervals 
less than T, with an error numerically less than Me m+I with a suitably 
chosen sum, 

N 

£ (A i cos k jt + B, sin kjt) (ki — kj > k > 0 ) 

y-o 

in which M, N, k are constants dependent only on m and P, (^o = 0). 

(A) It is a capital result of Birkhoff that first order formal stability 
of Hamiltonian or Pfaffian systems in the case of a point of “generalized 
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equilibrium” of “ general” type implies formal trigonometric stability 
[26]. 

(b) Recurrence and transitivity. A subset S of the phase space M 
which is carried into itself by the “flow” on M is called invariant. A 
point P of S is called wandering relative to 5 if there exists a neighbor¬ 
hood W of P relative to S which, for some /o>0, and for t>to, has no 
image which intersects W. A maximal invariant subset S* of S whose 
points are non-wandering relative to S* is called central in S. BirkhofF 
shows by transfinite induction that every non-empty invariant sub¬ 
set S of M possesses a central subset S*. Such a set 5* is closed rela¬ 
tive to S and non-empty. If the system is conservative as defined 
earlier, the set which is central relative to M is M; but if M is non¬ 


conservative, the central set M * of M is in general a proper subset 
of M. Let N be an arbitrary neighborhood of M* on M. The fraction 
of time which the finite arc PoP* spends in N tends to 1 as / becomes 
infinite, independently of the choice of Po on M [26]. 

A closed invariant subset of M no proper subset of which is a 
closed invariant set is called minimal and any sub-motion (complete 
trajectory) of the set is called recurrent [18 ]. A periodic motion is a 
very special minimal set. For a motion P* to be recurrent it is neces¬ 
sary and sufficient that for every e>0 there exists a P>0 so large 
that any arc P,Pi+r of the motion has points within a distance e of 
every point of the motion. Limit points of a motion P t as l becomes 
negatively or positively infinite respectively are called a- and w-limit 
points respectively. Birkhoff shows that the a - (or w-) limit motions 
of a given motion include at least one minimal set. In general, mini¬ 
mal sets do not reduce to a periodic motion. Recurrent motions on 
surfaces of negative curvature can be studied and classified with the 
aid of the topological symbolism of Hadamard, l6 as Morse first showed 
in his thesis. 


In his desire to determine the ultimate significance of formal sta¬ 
bility of Hamiltonian or variational forms, Birkhoff goes beyond the 
case of Hamiltonian or Pfaffian systems. In such general dynamical 
systems (A) is not true. Birkhoff defines even more general formal 
transformations and states that if these be admitted, any dynami¬ 
cal system can be formally reduced near an equilibrium point of gen¬ 
eral type to Hamiltonian form provided the conditions of first order 
and formal trigonometric stability are satisfied [25]. The meaning of 
this bold step is not altogether clear. 

Departing from the domain of formal operations, permanent stabil- 

“ For a general theory see Marston Morse and Gustav A. Hedlund, Symbolic 
dynamics, I and II, Amer. J. Math. vol. 60 (1938) pp. 815-866; vol. 62 (1940) pp. 1-42. 
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ity is said to occur when sufficiently small initial displacements at 
/==/„ from an initial solution cause displacements from the initial 
solutions which are arbitrarily small for all time. Formal conditions 
for permanent stability are not known. In particular it is not known 
whether or not a point of generalized equilibrium which is formally 
“completely” stable is in general permanently stable. The theory in 
the large is tied up with the theory in the small because the validity 
of the hypothesis of metric transitivity in a system would imply that 
permanently stable periodic orbits do not exist. 

A motion is termed transitive if it is everywhere dense in the phase 
space M. If such a motion exists the system is called topologically 
transitive. Birkhoff exhibits examples of such transitivity. The most 
general case of topological transitivity established up to this time in 
dynamical systems of classical type is that of geodesic motion on a 
surface of genus greater than 1 on which no point has a conjugate 
point. 17 

Under metric transitivity , as first defined by Birkhoff and P. A. 
Smith [27], the only invariant subsets of M are sets whose measure 
isO or m(M). Metric transitivity implies topological transitivity. The 
converse is probably true in analytic systems or systems with some 
degree of analytic regularity. A proof or disproof of this converse is 
much needed. 

Smith and Birkhoff [27] have defined central motions of iterates 
of a surface homeomorphism and characterized them in various ways. 
Metric transitivity is shown by them to be equivalent to the condi¬ 
tion that no two “invariant integrals” on 5 be linearly independent. 
In the general analytic case at least two central motions arc shown 
to exist. 

B. 0. Koopman [30] discovered that a conservative flow T t on M 
with an invariant measure on M could be represented by a one- 
parameter family of unitary transformations in Hilbert space, thus 
opening up a new mode of investigation of dynamical systems. Fol¬ 
lowing this, von Neumann established his “mean ergodic theorem” 
appropriate to Hilbert space. Under the stimulus of these ideas, 
Birkhoff [29] saw that in treating transitivity open sets should be 
replaced by sets of positive measure. Birkhoff's first theorem is what 


17 Marston Morse and Gustav A. Hedlund, Manifolds without conjugate points. 
Trans. Amer. Math. Soc. vol. 51 (1942) pp. 362-386. For other aspects of the problem, 
see J. C. Ox toby and S. M. Ulam, Measure preserving homeomorphisms and metrical 
transitivity, Ann. of Math. vol. 42 (1941) pp. 874-920, and P. R. Halmos, In general 
a measure preserving transformation is mixing, ibid. vol. 45 (1944) pp. 786-792. 
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is commonly known as his ergodic theorem. We shall give a formulation 
due to Khintchine. 18 

The space M is assumed to have a finite measure m invariant under 
the flow. Let } be integrable over M and let P be a point of M. Then 

tA ,. JofiPddt 

(4.7) lim- 

T 

exists for almost all P on M. 

Extensions and applications of the ergodic theory have been made 
by Wiener, Wintner, E. Hopf, Garrett Birkhoff and others. If / is 
the characteristic function of an arbitrary measurable subset V of M, 
the integral in (4.7) is the measure of the time spent by P t in Kfor 
0£/£7\ and, in case the system is metrically transitive, Birkhoff 
shows that the limit in (4.7) equals the ratio of m(V) to m(Af) for 
almost all P on M. 

(c) Periodic orbits. In the case of a Hamiltonian or Pfaffian system 
with two degrees of freedom and with a periodic orbit X (period 2ir) 
one can use an integral to reduce the dimension of the phase space 
containing X to n =3. Let 5 be an element of a regular analytic “sur¬ 
face of section” which cuts across X at some point Q. If P is a point 
of S sufficiently near Q the orbit P t will cut S again later at a point 
T(P). PoincarS introduced these transformations T oi S into itself 
in order to establish the existence of periodic orbits in the neighbor¬ 
hood of X. It is clear that a necessary and sufficient condition that P 
represent an orbit with a period near 2ir is that 

P = T(P). 

One can similarly use the £th iterate T k (P) of the transformation 
T(P) to find orbits near X with periods near 2kir. 

The orbit X is termed simple if its equations of variation have no 
nonzero periodic solution. If X is simple it cannot be a member of 
an analytic family of periodic orbits, and, bearing in mind no doubt 
that this excludes systems classically called “integrable,” Birkhoff 
terms the problem non-inlegrable if every periodic orbit and its multi¬ 
ples are simple [26]. 

A generalized equilibrium point has been termed degenerate if the 
Hamiltonian function H can be formally reduced to quadratic terms. 
A system in which each periodic orbit is nondegenerate is termed non¬ 
degenerate. Non-integrable nondegenerate systems exist as Birkhoff 
shows. Let (u, v) be regular coordinates on 5 neighboring a fixed 

,s A - Khintchine, Zu Birkhof s Lbsung <Us ErgodcnprobUms, Math. Ann. vol. 107 
(1933) pp. 485-488. 
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point (0, 0) of T. If the system is non-integrable and nondegenerate 
the terms of first order in T reduce either to a rotation about (0, 0) 
through an angle incommensurable with 2ir (fixed point of elliptic 
type) or to the form 

u' = pu, v' = — v (p ^ 0 or 1) 

P 

where p is real (fixed point of hyperbolic type) [23]. 

Following Poincare, Birkhoff makes use of the existence of an in¬ 
variant integral in the original phase space to obtain an invariant in¬ 
tegral [23, 33] 

ff Q(U. v)iudv (Q > 0) 

on S neighboring the origin, where Q is real and analytic in ( u , v). 
Birkhoff shows that there exists a real power series F(u, v) which 
may be convergent or divergent, and is formally invariant under T. 
The series F starts with terms of the second degree or higher. The 
series Q and Fare formally related by differential equations 


du 

Q Tk mF - 


dv 

Q 7k‘- F ' 


whose formal solutions u and v starting with terms of the first order 
yield T k when k is replaced by an integer. 

Formal series transformations of u and v reduce T to a transforma¬ 
tion 


(4.8) 


u\ -f ivi = (« + iv)e K{r)i (h(r) = a + cr* m ) 


when X is simple and nondegenerate and the fixed point (0, 0) elliptic- 
Here a and c are real constants with cj *0 and a incommensurable 
with 2 t. The exponent m is a positive integer in general 1. Whenever 
(4.8) holds, T is actually as well as formally very close to a rotation 
through an angle a, for sufficiently small positive r. For a simple 
nondegenerate orbit and a hyperbolic fixed point, T has the normal 
form 

1 


u i = pue 


c(uv)’ 


Vi = — ve~ eluv) (m > 0; c ^ 0) 
P 


where p is real and neither 0 nor 1. The use of these forms to obtain 
approximate representations of T is reminiscent of Birkhoff’s use of 
asymptotic representations in his thesis, in the Riemann problem and 
in the formal Hamiltonian theory. 



378 


MARSTON MORSE 


(May 


In [33 J Birkhoff largely removes the condition that the orbit be 
“simple” although this task is one of great formal complexity. When 
the periodic orbit is “multiple” the terms of first order in the repre¬ 
sentation of T have equal characteristic roots. In such special cases 
the corresponding fixed point Q oi T \s termed hyperbolic if there 
exists a real invariant curve passing through Q. Otherwise Q is termed 
elliptic. 

To establish the existence of periodic orbits in the restricted prob¬ 
lem of three bodies Poincar6 introduced his celebrated “last geometric 
theorem” but was unable to prove it. 19 This theorem may be stated 
as follows: 

Given a ring 0 <a^r £b in the r, 0, plane and a 1-1 continuous area¬ 
preserving transformation T of the ring under which points on r=a ad¬ 
vance, and those on r = b regress, there will exist at least two points of the 
ring invariant under T. 

A number of mathematicians had attempted to prove this theorem 
and at least one erroneous proof was published. Urged on by his col¬ 
leagues, Birkhoff [l9] gave a beautiful proof of the theorem that 
went directly to the heart of the problem. Modified forms of the 
theorem were also presented by Birkhoff [24]. 

Using theorems of a nature similar to that of Poincard, Birkhoff 
proves the following in [26]: 

(a) In non-integrable, nondegenerate Hamiltonian systems with two 
degrees of freedom any neighborhood of a periodic orbit of elliptic type 
includes infinitely many periodic orbits both of the elliptic and hyper¬ 
bolic types. At most a finite set of these orbits have periods less than a 
given constant. 

A generalization of the PoincarS theorem to higher dimensions is 
given in Birkhoff [28]. This generalization employs the theorem of 
Morse that the number of critical points, suitably counted, of an 
analytic function defined on the product of n-circles is 2 n . With D. C. 
Lewis, Birkhoff’s theorem is applied in [31 ] to establish the existence 
of periodic orbits neighboring a periodic orbit of general, formally 
stable type. 

A periodic orbit X of hyperbolic type cannot be permanently stable. 
When X in a is hyperbolic there exist [23] two analytic curves on 5 
passing through (0, 0) and invariant under T{P). These invariant 
curves on 5 imply the existence of analytic families of motions asymp¬ 
totic to X in either sense. 

No example is known in which the elliptic orbit X of (a) is “perma- 


19 H. Poincarl, Rend. Circ. Mat. Palermo vol. 33 (1912) pp. 375-407. 
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nently stable” in the geometric sense and none such could exist if the 
system is metrically transitive. Nevertheless Birkhoff gives a rela¬ 
tively complete description of motions neighboring X in case X is in 
fact permanently stable. Since this description is similar to the one 
given by Poincare, it is omitted. 

A priori existence theorems for periodic orbits are very necessary. 
Birkhoff turns to the “direct” method in the calculus of variations, 
and uses the Jacobi “action integral” J. In Lagrangian problems 
in which the constraints are independent of the time this integral 
will in general be regular and positive definite in the sense of classi¬ 
cal variational theory. A periodic minimizing extremal will then 
exist in an “extremal convex” region. No periodic minimizing ex¬ 
tremal exists in very general problems, as for example in the restricted 
problem of three bodies with a parameter sufficiently small. To fill 
in this gap in the theory Birkhoff applied a “minimax principle” [21 ]. 

In Birkhoff’s applications this principle reduces to an existence 
theorem for critical points of an analytic function F(x ) of n-variables. 
If one supposes for the sake of definiteness that F is defined over a 
regular, compact, analytic manifold, then, suitably counted, there 
exist at least R\ + Mq — 1 generalized saddle points, where Ri is the 
linear connectivity of the manifold and A/ 0 the number of points 
(supposed isolated) of relative minimum of F. In similar or related 
forms this principle was known and applied by Poincar6, Maxwell, 
and Kronecker, and has an origin even more remote in the past. 
Birkhoff’s bold step was to conceive of its application to functions of 
curves such as the integral J. He applied it in the billard ball problem 
[26] (motion on a convex table) and to obtain closed geodesics on a 
convex surface. On slipping an elastic band over the surface from one 
egg-shaped end to the other the process that requires the least stretch¬ 
ing leads to a closed geodesic of minimax type. However, Birkhoff’s 
conjecture that the closed geodesic analogously obtained for a topo¬ 
logical n-sphere is of “minimax type,” as characterized by him, is 
not true if n>2 . 

The a priori existence of periodic orbits is necessary for the dynami¬ 
cal theory as conceived by Poincare and Birkhoff. 

(d) General qualitative dynamical theory. This theory is limited al¬ 
most wholly to the case of two degrees of freedom m = 2 largely 
because a fundamental tool with Birkhoff as with Poincare is a “regu¬ 
lar surface of section.” One presupposes a three-dimensional, analytic, 
compact, nonsingular phase space. A regular surface of section S is 
then an analytic surface crossed in the same sense by all trajectories 
without being tangent to 5 except along the boundaries. These bound- 
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aries are to be finite in number and consist of closed trajectories. 
Further every trajectory except the boundary orbits shall cut S at 
least once in any sufficiently large fixed interval of time. 

It seems likely that such surfaces of section exist in general when 
tn=2. As Poincar6 indicated, such a surface exists in the restricted 
problem of three bodies when the mass p (that is, earth) is sufficiently 
small. Although open hyper-surfaces of section exist when m >2 it 
has not yet been found possible to make profitable use of them. 

As in (c) the “flow” in the phase space defines and is characterized 
by a conservative transformation T of S into itself. Birkhoff makes 
three tentative hypotheses none of which has been proved to hold in 
general, but each of which probably does so hold. These hypotheses 
as given in [33] are: 

(A.l) The dynamical system is transitive. 

(A.2) A regular surface of section S exists. 

(A.3) There is at least one point which is fixed under T, but any 
such point, if elliptic, corresponds to a nondegenerate, periodic orbit. 

This qualitative theory culminates in the paper [33] 130 pages 
long, crowned by the Pontifical Academy. This paper resumes and 
extends much of Birkhoff’s earlier dynamical theories. Upon finishing 
this work Birkhoff commented to the author that it was an exhausting 
task. In this connection it is of historical interest to recall that Birk¬ 
hoff stated about 1925 that he considered his Transactions paper [21 ] 
for which he received the Bdcher prize as good a piece of research 
as he would be likely to do. In the writer’s opinion, the Rendiconti 
memoir [20], also a prize paper, should be placed near the top of 
Birkhoff’s works, not because it is final but because one finds Birkhoff 
there first meeting the problems which he inherited from Poincar6, 
and taking the first clear cut concrete steps which he later generalized. 

The Pontifical memoir can only be broadly summarized. After ex¬ 
tensive formal studies already reported on, Birkhoff shows that under 
hypotheses (A) there exists an infinite number of hyperbolic points 
fixed relative to some of the iterates T k of T. There may exist no 
elliptic points, but if one exists, there exist infinitely many such ellip¬ 
tic points. The general existence of the hyperbolic fixed points makes 
them a suitable instrument on which to base a qualitative character¬ 
ization of T and this Birkhoff proceeds to do. 

On 5 the invariant curves C through a hyperbolic fixed point P 
are divided into a-branches E a (P) and co-branches E U (P) according 
as the iteration of T~ l or T makes a point on C tend toward P on C 
as a limit. In the original phase space each point on an a-branch 
(co-branch) defines a motion which becomes negatively (positively) 
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asymptotic to the periodic orbit determined by P. Returning to S, 
Birkhoff shows that under hypotheses (A) these a- and w-branches 
associated with a single hyperbolic fixed point P, if continued ana¬ 
lytically, are everywhere dense on S. There is an exception which does 
not occur for most fixed points P when one of these branches belongs 
to a second hyperbolic fixed point. These a - and w-branches intersect 
in infinitely many points termed “homoclinic” by PoincarS. Homo¬ 
clinic points are proved to be cluster points of fixed points. 

In the case of a metrically transitive system the set of all fixed 


points and of all a- or w-branches attached to hyperbolic fixed points 
has a measure zero on the surface of section. The remaining points 
are grouped into sets 2 0 termed isomorphic by Birkhoff. Let Q be a 
point which belongs to no set E a (P) or E U (P). The maximal con¬ 
nected set of points which contains Q but no point of a set E a (P) 
or E U (P) is independent of P among hyperbolic fixed points P and is 
denoted by 2 q. These sets are transformed into each other under T, 
hence the term isomorphic. It is probable that 2 Q = @ in general ; this 
case is called regular. 

Birkhoff shows in what sense the intersections of an E a (P) and an 
E m (P) “characterize” the isomorphic sets 2 0 . He invents a 2-dimen- 


sional symbol called the signature of the system which displays the 
topology of the intersections of E a (P) and E„{P). For Birkhoff such 
a signature is the ultimate in the qualitative description of a dynamical 


system. 


These a - and w-branches belonging to a hyperbolic point P are 
properly called asymptotic to P. Birkhoff also defines connected sets 
asymptotic to elliptic points. These are sets and not curves, at least 
a priori. They were studied at length by Birkhoff before he recognized 
the importance of the a- and w-brariches belonging to hyperbolic 
points. He indicates the formal analogies between the sets asymptotic 
to elliptic points and the a- and w-branches attached to hyperbolic 
points. 

(e) The restricted problem of three bodies. This famous problem was 
studied by Birkhoff in three principal papers [20, 32, 34]. Much of 
the work in these papers is an application or exposition of general 
theories elsewhere reported. There remain several special achieve¬ 
ments to be noted. 

The two finite bodies (that is, sun and earth) have masses 1 —y. 
and n respectively, and the third body ( x , y) has a mass which is 
infinitesimal. The two principal bodies are originally supposed to ro¬ 
tate about their common center of gravity in the clockwise sense. 
In the representation of the problem in the ( x , y)-plane the coordi- 
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nates have been so transformed that the earth and sun rest fixed at 
points on the x-axis with their center of gravity at the origin. The 
problem then has the Jacobi integral 

where 12 is a positive function of special nature. 

When C is sufficiently large the point (x, y) is confined in “Case I” 
to an oval about one of the finite masses. The phase space is then 
shown by Birkhoff to be the homeomorph of the three-dimensional 
projective plane. Topological models are given for the phase space 
for other values of C. Birkhoff keeps to Case I in general. 

Direct periodic orbits had been established by PoincarS only for 
sufficiently small values of /i. Birkhoff shows that in Case I at least 
one retrograde periodic orbit symmetric with respect to the x-axis 
always exists. He replaces the ring transformation of Poincar6 which 
led to the “last geometric theorem” by a transformation of a diskoid 
into itself. As long as this construction is possible there exists a direct 
periodic orbit. But such a construction is definitely established only 
for sufficiently small values of m- However the existence of the diskoid 
seems to demand less by way of proof than the existence of the 
Poincar6 ring. 

Returning to the ring transformation of Poincar6, Birkhoff con¬ 
cerns himself with the existence of orbits which are symmetric with 
respect to the x-axis. He shows that the transformation of the ring 
surface of section 5 into itself is the product of two involutory trans¬ 
formations. He attaches two characteristic integers to a symmetric 
periodic orbit as follows. Suppose that this orbit cuts 5 at a point P. 
If T represents the ring transformation of Poincar6 there is a least 
integer k such that the T k (P)=P . The ^-iterates of P on S will rotate 
about the PoincarS ring / times. The pair ( k, l) characterize the orbit. 
Birkhoff shows that there are infinitely many pairs ( k , /) which repre¬ 
sent symmetric periodic orbits and he exploits this symbolism to the 
full. 

After reading the preceding report on Birkhoff's advances in dy¬ 
namics the reader may find it of interest to compare a summary [22] 
in 1920 and a prospectus [35] in 1941 by Birkhoff himself. The sum¬ 
mary occurs under the title Recent advances in dynamics and the 
prospectus is entitled Some unsolved problems of theoretical dynamics 
and was presented at a fiftieth anniversary symposium at the Uni¬ 
versity of Chicago in 1941. After a lapse of twenty-one years the prob¬ 
lems remain essentially the same, but the approach and particularly 
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fhp language of approach has become more topological and abstract. 
“Conservative flows” are to be studied both in the topological and 
the statistical sense, and abstract variational theory is to enter. There 
is no doubt about the challenge of the field, and the need for a power- 
ful and varied attack. 


V. Physical theories 20 

The earliest evidence of BirkhofF’s interest in relativity can be 
found in a review of Books on relativity [36], in 1922. In a few words, 
while commenting upon the development of Einstein’s general theory, 
he formulates what has turned out to be his own ideal in the creation 
of physical theories: “In the first place, it illustrates afresh the im¬ 
portance of taking the simplest possible case as an abstract basis of 
departure. Secondly, Einstein uses mathematical analogy in passing, 
step by step, from the simple universe of the special theory to the 
most general universe, and at each step the mere sense of mathemati¬ 
cal form is sufficient to point the way to a natural generalization. 
The mathematician may feel satisfied that the formal analogies sup¬ 
plied by classical dynamics and four-dimensional geometry furnish 
the very basis by which Einstein’s generalization proceeds” (page 
217 ). 

In 1923, in his widely acclaimed monograph, Relativity and. modern 
physics (written with the cooperation of R. E. Langer), BirkhofT car¬ 
ries out this ideal as closely as he can by establishing the electromag¬ 
netic equations, and the fundamentals of the special and general the¬ 
ories of relativity, showing how this can be done without indiscriminate 
appeal to physical intuition, and with a maximum of regard for geo¬ 
metrical symmetry, mathematical rigor and simplicity. 

Feeling that “without a true model as a starting point, it does not 
seem likely that a final conception of the physical universe can be 
arrived at” [37, p. 70], BirkhofT selects a “perfect fluid,” which satis¬ 
fies the classical equations of hydrodynamics, slightly modified and 
unified so as to be invariant under the Lorentz transformations. The 
pressure-density relation was such that all disturbances would be 
propagated with the speed of light. This fundamental property of the 
substance of which all matter is composed was intended to eliminate 
those paradoxes in the theory of collision which BirkhofT pointed out 
in his presidential address of 1926 [40 ]. 

In this address, BirkhofT urged theoretical physicists to attempt to 

10 This account of Birkhoff’s physical theories has been written by Morse’s assist¬ 
ant Dr. Richard Arens, who had the advantage of numerous conversations on this 
subject with Birkhoff in the last years. 
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account for atomic phenomena using mainly the ideas of special rela¬ 
tivity and abstract dynamics, for he felt that the recurrence proper¬ 
ties of dynamical systems could eventually provide an intuitive ex¬ 
planation of quantum effects (cf. also [42, p. 317]). He made a modest 
attempt in this direction himself [39, 40]. He welcomed the return 
to the use of differential equations, due to Schrodinger, and justified 
these equations, from a general point of view, in [41 ]. 

Although it soon appeared that the “perfect fluid” could not be 
limited to a desirable behavior by its hydrodynamical equations alone 
[42, p. 324], Birkhoff retains it as raw material for “mathematical 
model-making,” perhaps because he feels that any physical theory 
should contain some statement about the constitutive equations of 
“matter” [44, p. 301 ]. The best cross section of Birkhoff’s ideas after 
ten years of contemplation can be obtained from his Franklin Insti¬ 
tute Lecture of 1938, Electricity as a fluid [43], in which he states his 
ideas, opinions, and preferences with unusual directness, clarity and 
humor. 

The final phase, which was undoubtedly intended to be only the 
beginning, of Birkhoff’s work in relativity was his gravitational theory 
[43] of 1943. With such a degree of analogy to electrodynamics as 
the essence of things permits, this theory involves a gravitational 
tensor potential governed by a linear differential equation which is, 
as seems proper in view of the underlying flat but Minkowskian 
space-time, of hyperbolic type. Whereas in electrodynamics the ac¬ 
celeration of a charged particle is.a linear function of the velocity with 
coefficients characterising the field, here the acceleration depends 
quadratically on the velocity. The theory is completely linear, and 
homogeneous except for the presence of the “perfect fluid.” 

As regards experimental verification, Birkhoff’s theory predicts the 
same rate of rotation of the apse line of planetary orbits as does the 
general theory of relativity. Recently, moreover, a practical method 
of successive approximation for treating the two body problem has 
been worked out. 21 

In Birkhoff’s theory, phenomena such as the bending of light 
around the sun cannot be regarded as of electromagnetic nature, 
since there is no interaction of electromagnetic and gravitational 
field; but Birkhoff finds that by considering the world lines of 
“photons” as limits of world lines of particles on which the speed ap¬ 
proaches that of light, the proper magnitudes for the bending of light 
and red-shift effect can be obtained [44a]. He has also discovered 

,l C. Graef Fernandez, El movimiento <U los dos cuerpos en la teoria de la gravitacidn 
dc Birkhoff, Boletin de la Sociedad Matemitica Mexicana vol. 1 (1944) p. 25. 
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another approach to the red-shift effect which makes use of the wave- 
length-energy relation of Planck. These ideas merit careful investiga¬ 
tion even apart from the remaining theory. 

It has been stated 22 that this theory does not provide for the iden¬ 
tity of gravitational and inertial mass in as effortless a manner as 
the theory of Einstein, and this seems to be a defect from which 

linear theories must suffer. 23 . , . 

Birkhoff inherited from PoincarS the sentiment that no single math¬ 
ematical theory of any phenomenon deserves the exclusive attention 
of physicists, or at least of mathematicians. 

VI. Miscellaneous works 

(a) Fixed points in function space. The paper [45] written jointly 
with 0. D. Kellogg is undoubtedly one of Birkhoff’s most important 
contributions. The first theorems are generalizations of the Brouwer 
theorem that a continuous transformation of an n-disc into itself has 
at least one fixed point. This is applied to a transformation of a space 
of uniformly bounded and equicontinuous functions /($) and of the 
space of functions f(s) which are summable square and representable 
by a subset of points in Hilbert space for which converges uni¬ 

formly. Fixed elements f(s) are inferred. 

A second point of departure is the theorem that a continuous trans¬ 
formation of a real projective space of even dimensions into itself 
has at least one fixed point or “invariant direction.” This theorem 
extends to integral equations. 

Deformations of transformations are extensively used. A continu¬ 
ous transformation 7\ on an n-sphere H n into the embedding (n + 1)- 
space is given as depending linearly on a parameter X in such a 
manner that as X varies to 0, 7\ deforms into the identity without 
intersecting the origin. It is concluded that to each point (b) on II n 
corresponds a point (a) (“inverse direction”) on II n whose image 
under T\ is on the same ray as (b). This theorem has applications in 
the Fredholm theory. 

It is of interest that the paper in the Transactions preceding [45] 
was an elegant treatment by Alexander of “transformations with in¬ 
variant points,” including many of the theorems in [45] concerning 
transformations of finite-dimensional spaces but not including appli¬ 
cations to function spaces. The work of Birkhoff and Kellogg was the 

n H. Weyl, Comparison of a degenerate form of Einstein's with Birkhoff's theory of 
gravitation, Proc. Nat. Acad. Sci. U.S.A. vol. 30 (1944) pp. 205-210, especially p. 205. 

33 Loc. cit. p. 206. 
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acknowledged stimulus of the noteworthy extensions of Schauder, 
and later of Schauder and Leray, including important applications to 
partial differential equations. The invariance of the Brouwer degree 
of a transformation under continuous deformations is a unifying ele¬ 
ment in the Schauder-Leray theory. 

It is of interest to recall that Birkhoff’s introduction to fixed point 
theory was by way of the lemma of Poincar6 that the sum of the 
signed indices of an analytic vector field on an orientable surface of 
genus p is 2 — 2 p. Birkhoff used this theorem in his theory of elliptic 
and hyperbolic fixed points of the surface transformations arising in 
dynamics [21 ]. 

(b) The 4-color problem. u Birkhoff’s interest in this problem was 
aroused in Veblen’s seminar in “analysis situs” during his years at 
Princeton. Birkhoff brought out two papers [46, 47] on this subject 
during his stay at Princeton. 

In [47] Birkhoff revived the qualitative approach based on Kempe 
chains. He found certain “reducible configurations,” that is, figures 
whose presence in a regular map reduces the coloration problem to 
that of a simpler map. Of these the most important and the hardest 
to analyze was a ring of two regions not surrounding a regular penta¬ 
gon. Of such reducible configurations later found by Franklin, Errera, 
and Winn, many are direct extensions of those of Birkhoff and most 
require for their proof Birkhoff’s theorem on rings of five regions. The 
minimum number of regions in a map not capable of coloration in 
four colors was shown successively to be at least 25 (1922, Franklin), 
27 (1927, Reynolds), 31 (1938, Franklin), and finally 35 (1940, Winn). 

In [46] Birkhoff introduced the quantitative approach. “Chromatic 
polynomials” P(x) equal to the number of ways a given map can be 
colored in x colors were found. Although the main objective of show¬ 
ing that P(4) >0 was not achieved, many properties of P(x) were 
obtained by Birkhoff [46, 48] and later by Whitney whose thesis 
was on this problem. Birkhoff hoped that the theory of chromatic 
polynomials could be so developed that methods of analytic function 
theory could be applied. Birkhoff took the first real step in this direc¬ 
tion in [49]. 

The quantitative approach was not as successful in the study of the 
classical problem as the qualitative. Nevertheless Birkhoff felt that 
it might be easier to establish a stronger result concerning P{x) than 
a direct and weaker theorem of qualitative type. In a long joint paper 
with D. C. Lewis, submitted to the Transactions, a fusing of the quali- 

u The writer is indebted to Philip Franklin and to D. C. Lewis for summaries of 
work on the 4-color problem. 
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tAtive and quantitative approaches is sought by way of a study of 
P(x) borrowing and modifying the method of Kempe chains. In this 
study the reducibility of certain configurations previously established 
by Kempe chains is established by quantitative methods. Deeper in- 

Sig This paper includes a conjecture to which Birkhoff attached the 
maximum importance. The map of the sphere under consideration 
consists of n +3 simply-connected regions containing only triple ver¬ 
tices, that is, points on the boundaries of just three regions. Let 
p ^ be the number of ways of coloring this map in x colors. It is 
known that this polynomial is of degree »+3 and vanishes when x = 0, 


1 and 2. Hence 


«*) = 




x (y - i)(y - 2 ) 

is a polynomial of degree n in x. The conjecture is that 


<*-3)-S «*)£(*.- 2)" 


for x ^ 4 and n>0. The conjecture is readily established for x^S. It 
is also established forn<9 even when x*=4. It has been verified for 
special maps of 12, 14, 15, 16, and 17 regions with x =4. 

This paper also contains an asymptotic formula for P„(x) conjec¬ 
tured by Birkhoff as a result of heuristic probability considerations. 

(c) General mean value and remainder theorems. The paper [50 ] of 
Birkhoff written in 1904 is somewhat complicated in form as is per¬ 
haps necessary. There is given the value of /(x) and some of its 
derivatives up to the (n—l)st at n points on an interval (a, b). 
Birkhoff introduces a polynomial F which together with its deriva¬ 
tives equals/and its derivatives at the respective points in question. 
The function / ( " -1) (x) is assumed continuous, and f {n) is assumed to 
exist on (a, b ). The general remainder theorem involves/ and F and 
includes most known remainder theorems. It has applications in the 
fields of mechanical differentiation and quadratures. 

(d) Aesthetic measure. Birkhoff’s interest in the analysis of art and 
music forms was of long standing. It culminated in his famous treatise 
on aesthetic measure [51 ]. To oversimplify this work of Birkhoff 
would result in a misrepresentation. For this reason and because of 
the readability of [51 ] no extended account of Birkhoff s work in this 
field will be presented. 

Within each class of aesthetic objects Birkhoff defines the order 0 
and complexity C so that their ratio 

O 

M = — 

C 
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yields the “aesthetic measure” of any object of the class. Equal at 
least in importance to the particular value of M attached to an aes¬ 
thetic object is the fact that such an attempt at aesthetic analysis 
leads to distinctions and comparison of a more refined nature than 
those ordinarily made. His objects of study include polygonal forms, 
ornaments and tilings, vases, melody and harmony, and musical qual¬ 
ity in poetry. Birkhoff’s work is in the spirit of many a great artist and 
artisan of bygone days. The writer recalls the impression made upon 
him by finding Sheraton, the great creator of 18th century furniture, 
starting a treatise on design with descriptive geometry and ending 
with sideboards. 

In addition to this principal work Birkhoff wrote a number of 
papers on the subject, and spent a half year on leave of absence in 
the Far East and Europe to gather material. His study has been of 
interest to artists, musicians, psychologists, historians, and even 
mathematicians. The writer has found Birkhoff’s theories on melody 
of practical value in the problem of effective improvisation on the 
organ. The high value which Birkhoff put on the aesthetic is illus¬ 
trated by the answer he gave a professional musician as to why one 
should study mathematics. “One should study mathematics,” said 
Birkhoff, “because it is only through mathematics that Nature can 
be conceived in harmonious form.” 

(f) Basic geometry. Birkhoff and Ralph Beatley joined forces in the 
interest of the teaching of elementary geometry around 1929. In 1932 
Birkhoff [52] wrote a set of postulates for plane geometry based on 
the scale and protractor. These studies were capped by a textbook 
[53] on Basic geometry , written with Beatley and published in 1940. 
The necessity of undefined terms and assumptions is emphasized from 
the outset, doing away with “self-evident truths.” There are five 
fundamental postulates, seven basic theorems and nineteen other the¬ 
orems together with seven in loci. The system of real numbers is in¬ 
corporated into three of the five axioms leading at once to the heart 
of geometry. Reports on the use of the book in practice are favorable. 
In any case the introduction of new ideas from the pen of a man as 
eminent as Birkhoff should be a great stimulus to the subject of ele¬ 
mentary geometry. 

In summarizing the mathematical work of Birkhoff a sentence of 
PoincarS comes to mind. Of the periodic orbits in dynamical systems 
he says that they are “la seule brfcehe par ou nous puissons essayer de 
p6n6trer dans une place jusqu’ici r€put6 inabordable.” PoincarS would 
be amazed to see the extent to which Birkhoff has widened this 
breech and opened many others. His zeal never flagged. 
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During the major part of his life Birkhoff was the acknowledged 
leader of American mathematics. His nomination as President of the 
International Congress of Mathematics that was to have been held 
at Cambridge, Massachusetts in 1940 was symbolic of this fact. 
Learned societies and universities the world over honored him. His 
work was crowned with four prizes. In the case of a man such as 
Birkhoff, a ranking at this time in the hierarchy of the great, in an¬ 
ticipation of the verdict of history, seems of doubtful value. The au¬ 
thor can find no words adequate to define him except that he was 
profoundly unique. Those who knew him best can add very simply 
that he was a kind and courageous friend for whom they will have a 
lasting affection. 
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ON THE ASYMPTOTIC CHARACTER OF THE SOLUTIONS OF 
CERTAIN LINEAR DIFFERENTIAL EQUATIONS 
CONTAINING A PARAMETER * 

BY 

GEORGE D. BIRKHOFF 

The wra of the present paper is to develop the asymptotic character of the 
solutions of linear differential equations of the form 

^ + + ... + p n a 0 (x % p)z - 0 

for large \p\. The functions a 4 (x, p) are analytic in the complex parameter p 
at p — oo and have derivatives of all orders in the real variable x. Schlesinger f 
has proved asymptotic properties for lim p — oo on some fixed ray arg p = a ; 
in this paper we prove similar properties for a region 6 = argp — yfr, but by a 
different method. In 1837 Liouville J treated the important special case 

%? + [f>’ + 9 , (*)]* = 0 

when p is real, the first problem of the kind to be considered; the method of 
attack used in this paper is of a similar nature. 

It is purposed to make an application of the results here obtained to boundary 
value and expansion problems in a second paper. 

I desire to make acknowledgment of the kind encouragement and valuable 
suggestions received from Professor E. H. Moore, for both papers. 

We consider functions z(x,p) of a real variable x on the closed interval 

•Part of a paper presented to the Society (Chicago), Mar. 30, 1907, under a different title. 
Received for publication October 23, 1907. 

fMathematisohe Annalen, vol. 63 (1907), pp. 277-300. This paper appeared after the 
writing of the present paper. See also a paper by J. Horn, Mathematisohe Annalen, vol. 
62 (1899), pp. 271-292. 

{Liouville’s Journal, vol. 2, p. 16. Cf. in particular $3. 
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(a, b) and of a complex parameter p, |p| > /?, which satisfy a linear differ¬ 
ential equation of the form 

<!) + + — + fa,(x, p)z = 0. 

Here we assume that the functions a,(x, p) are bounded by the inequalities 

(2) \a t (x 9 p)\2M |p|>A), 

and that 

(3) «<(*./»)- 

The coefficients a 0 of this last series we assume to be continuous with continuous 
derivatives of all orders on (a, b)\ but not necessarily real. As a consequence 
of (2) and (3) the a.(x, p) are continuous in x and p, for the series (3) are series 
of continuous functions of x and p converging uniformly for \p\ ^ R 0 > R. 
Further we postulate that the n roots 

»■(*), »,(*>, •••, «..(*) 

of the equation 

(4) «*■ + + • • • + “»(*) - 0 

are distinct for every x on (a, 6). 

By R(p-) we denote the real part of the complex number /*. 

Definition. By a region S of the p-plane we understand a region for 
which the indices 1 to n can be so arranged that 

(6) £ -»[/>«>,(*)] £ ... £ 

for every x on (a, 6 ) and every p on S. 

If p = p 0 is an S point, so that (6) obtains for p = p 0 , it is clear that (6) 
holds if only arg p = arg p 0 . By virtue of this fact the half line arg p = arg p Q 
belongs to the same region S. For a given x the relations (5) define a certain 
closed sector, 

*, s "g /> = *,. 

containing the half line argp = argj> # . The largest closed sector common to 
all these sectors, 

(6) ASargpS*. 

forms the region S y degenerating to a half line when 6 = yfr. The funda¬ 
mental theorem refers to the solutions of (1) on these regions S , when such 

regions, exist. 
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An important set of inequalities on a region S is 


Wo* 


(!) V* 

In an expression of the form 





*** [«.(»>»■* + «i(*)r*- , + —] 

the term 

shall be called the principal term. 

Finally we introduce the notation 

so that (1) becomes 

(8) A (*) - * w + <*„_,(*, p)t lm ~ l] + • • • + o 0 (*» />)* = 0 ; 

and also the notation as a generic notation for functions of p and other vari¬ 
ables bounded for \p \ « when It 0 is sufficiently large. ' 

Lemma I. For every value of i from 1 to n inclusive there exist an infinite 
number of functions u M (z), u„(x), • • •, continuous and with continuous deriv¬ 
atives of all orders , such that u m (x) does not vanish at any point of (a,b) 
and such that if the functions 

/ \ '/a'*'** / X , 

Jm9 


be substituted for z in the expression A(z) t the coefficient of 




\9 -0, 1, •••, m/ 


in the expression thus obtained vanishes identically. 

To prove this, write 

/ \ . . 

u(z,p) = e- / * u(z,p), 

where w(x) is some definite one of the n roots to.(z). We find then 
«'"(*. P)=^‘ [>(*)®(x, p) + ®">(*. />)] , 

« m (*[»(*)]*+ ^ w(x)jv(x, p)+ 2w(x )*"!(*, p) + v m (x, p)J, 
and, in general, 

(9) %W(xtp)me f S m p)v w (x, p) t 
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in which a >4 (x, p) is a polynomial in 1/p of degree j — 1 at most, whose 
coefficients are functions of x continuous on (a, b) and with continuous deriva¬ 
tives of all orders. We note that the principal term in the coefficient of v(x, p) 
in (9) is 

(10) “*.(*)= (>(*)]'• 
and the principal term in the coefficient of t> l,, (x, p) is 

(11) ■»„.(*) -■/IX*)]'"; 

also we note that 

«*(**?) — *• 

Hence we conclude 

(12) A[u(x,p)]meS‘"' > “j[v(x,p)] 
where 

(13) A (z) m »N + p)zl—'1 + • • • + a 9 (x, p)z , 

the Uj(x, p) being conditioned as the a y (z, p) are in equations (2) and (3). 
For convenience we write 

“-(*• #») — !. a,(*.*>)-» (»>«)• 

We see from (10), (11) that if we place 

then 

(14) a M (x) - [»(*)]* + «_,.,(*) [«.(*)]- + • • • + a M ( x ) = 0 
and 

(15) o„,(x) - n [«>(*)]-' + (" — 1 ) a .-,. t (*Jt* 0 (*)]"“' + ■ ■ ■ + «,„(*), 

so that a l0 (x) =$= 0 on (a, 6), the n roots to { (x) being distinct. 

If then we write in (12) 

■•-I 

«(*.P) = Z“y(*)/> _/ . 

JmO 

we find the condition that the coefficient of 


in A [u(x, p)] vanishes to be 




(» = 0 , 1 ,•••,*) 


(16) 


d‘ 


Z “«(*) £>“»(*)=°- 




The equation (16) is true for s = 0 by (14). For s 4= 0 we can write (16) in 
the form 
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gince if in (1®) ^ = s » fc ^ en f = j = 0 and the term corresponding to this set 
of values has a coefficient d M (x) = 0; if k = t — 1, we have either 2—1, 
*aa0or2«»0*i“l* an< f fc be corresponding terms are the first terms of (17). 

It appears then that *,_,(») can be determined in terras of 
u (*), u,_,(x)» • u 0 (x) as a solution of a certain linear differential equar 
tion of the first order which has no singular points on (a, 6) since a l0 (x) =j= 0. 
Thus u 0 (x)> u,(x), u ~-i( x ) are obtained in succession from (16) for 

s ■* 1, 2, • * •* m * For eac * 1 w t( x ) we obtain in this way a sequence of func¬ 
tions u i0 (x) t ««(*), • • •, such that if the expressions 

«i(*. p) = ^ 


be substituted for z in J (z), the coefficient of 


(< = 1 , 2 , 


0, 1, •••, m) 


vanishes by (17) since the conditions (16) are now satisfied for s = 1,2, • • •, m . 
Furthermore the differential equation for tt A (s) is homogeneous, so that by 
taking for u^fs) a solution which is different from zero at one point of 
(a, b ), we are sure that tf A (x) does not vanish at any point of (a, b ). 
Since the a<,(x) were continuous with all of their derivatives, the functions 
u i0 (x), u<,(*),••• are also continuous with all of their derivatives. The sequence 
of functions tf 0 (z) v u n (x), ••• has then the properties stated in the lemma. 

If the formal developments 

,/.•«>>"£«,(*)*>-' (< = 1 , 2 . •■•,») 

converged and admitted of n-fold term by term differentiation, we should have 
in them the asymptotic developments of solutions of our differential equation 
which we desire. This however will not in general be the case. We shall, 
nevertheless, show in the theorem below that by breaking off after m terms of 
these series (i. e., by retaining the part u.(x, p)) and putting in certain remain¬ 
der terms we can get true solutions of our differential equation. In order to 
prove this we must first prove another lemma in which the form of the dif¬ 
ferential equation is established which is satisfied by the sums of these m terms 
(i. e., u.(x % p)) without any remainder: 

Lemma II. The homogeneous linear differential equation of order n with 
n solutions u t (x, p) has the form 

B(z) m *M + 6_,(*> + ••• + J 0 (x, p)z = 0, 

where for | p | > R° 

(18) = &.(*, p) = Y.b (x)p-‘ (*' = 0 , 1 , ■■ , n — 1 ). 

1=0 
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The coefficients bj(x) which appear here are continuous together with all their 
derivatives , and 

(19) (» = 0,1, •••, » — l;i = o, 1, •••, m). 

The homogeneous linear differential equation of order n with solutions u t (x % p) is 
*W zl—'l 


( 20 ) 


\v[ n] ( x 'P) ul— 11 (*,/>) •• 

For the elements of this determinant we have 


M X >P) 


0. 


ti|0(x,p) 


~'f\ {x)p - 

*-« 


( 21 ) 

where by (10) 

(22) M*)” [ w <(*)] /m «(*) 

Thus if we factor out of (20) 

ii**~ 


<■1 


the differential equation takes the form 

(23) £.(*, ?)■>*> + /8_,(*, /»)»*— 11 + ■■■ + P 0 (x,p)z = 0, 

where £,(*, p) are polynomials in 1/p. We have for the principal term of 


*-(•> - 




• • • X 100 

• • 

X * 1-1 • 

• • 

- 

• • 

•—1, • 

K.-x. 

• I 
/< 


In view of (22) this last determinant may be written 



[«-,(*)]- [».(*)]- ••• “.(=') 1 

’n 

[«,(*)]-' [«>,(*)]-* ••• «,(*) 1 

n «»(*) 


<■1 



[“’.( z )]" 1 [“’.(*)]"’ “.( z ) 1 
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This is not zero at any point on (a, b) as the m t (x) are distinct and the 
u (x) 4= 0 by Lemma I. Therefore for |/>| = It 0 


PS x *p) 






But we can write (23) in the form 

(24) * w + + K(x, P) z — °* 

Now the functions b iJ (x) and their derivatives are continuous since the v) { (x) 
and u t (x) are of this character. The first part of the lemma is thus proved. 
Now let j 0 be the smallest value of j for which, for some i and x, 

M*) + «*(*)• 

From (21) and (22) we see that the principal term of 

(26) B[u,(x,p)]-A [«.,(*, p)] m S — A [«,(*, f>)] 

IS 

(26) />-*[£ [*•>.(*)-<«./.(*)][»,(*)]*]“•(*)• 

Assume j 0 — m if possible. In each part of the difference (25) the coefficient of 

p -i. 

must then vanish, in the first part since u.(x, p) are solutions of B(z) =* 0; 
in the second part, by Lemma I. Therefore from (26) 


Z [&./,(*)-«.,.(*)] [«>,(*)]*- 0 (•'=>.2. 


We conclude that 


*bO 


the w>.(x) being distinct. This is a contradiction. Hence j 0 > m. 
Theorem. On a region S there exist n independent solutions 


of 


d n z d m ~ l z 

+ P) fai-t + ••■ + P'<* 0 (x, p)z= 0 


analytic in p such that if the integer m is chosen at pleasure and p it on S y 
2 i(*» P) = «.(*» P) + E 0 p~~, 
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(27) 

where 




d — 1 , v d n ~ l . . , co* t? n -m+«-\ 

E -' p 


and u a (x) does not vanish at any point of (a, h). 

Proof We proceed to effect a comparison of the solutions of the given dif¬ 
ferential equation with the solutions of the differential equation of Lemma II. 
First we write the given differential equation in the form 

(28) + *>+■■■ +P m ’>,(x,p)z = p'B-A(t). 

The development of the coefficient of zl*l in B — A (z) begins with a term in 
p-»-\ in view of (18) and (19). so that 

(29) \ b „( x ' P) - a *( x » *01 “ D '\p\~ m ~' 

for \p\±K°- 

The general solution of a non-homogeneous linear differential equation 


may be written* in terms of a set of linearly independent solutions 
y t (x), y,(x), ■ ■ •, y„(*) of the reduced equation in the form 

(31) y = i>,?,(*) +J |^£y,(*)*,({)]■#■({)#• 

where the functions z,(x), t,(x), •••, a.(*)are determined from the conditions 


t(£ y ‘ {x) h 



(1 = 0, 1, •••, " —2), 
(/=n-J). 


while e l% c,. • • *, c„ are arbitrary constants. Any y and <f> satisfying (30) also 
satisfy (31) for some choice of c,, c,, •••, c„; and if y and <f> satisfy (31) for 
some c,,c t , ••*, c„, they also satisfy (30); hence (30) and (81) are entirely 
equivalent. 

«Cf., for instance, Schlksinoeb : Handbuch der Thtorie der linearen DifferentialgUidiungen , 
vol. 1, p. 78. 
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Thus treating (28) as a non-homogeneous equation we find an equivalent 
equation 

(82) *(*,*>) = !)«<“<(*• P)+j[ [£“<(*• P)\p' B - A l z (*< 

where the t>/s are obtained from the equations 

f m/ x / v 1° (1*0,1,—,n—2), 

If in place of these functions v t we introduce functions u t defined by the equation 

p)=p-"'*>(.*. p). 

equation (82) becomes 

( 88 ) z(x,p) = 'Ec l u l (x,p)+p£[i2uAx,p)u i {,{,i>)^B-A[z(i,f>)]dt, 
where the u i are determined from the equations 

(84) g*JV, ,)*(«,,)-(J < ( I i=--l)V’ B-3) ' 

In order to prove the existence of a solution z k (x % p)o f the character stated 
in the theorem for some definite k we make the final transformation of the 
constants c t in (83), 

c 4 »c; (1*1,S,—,ft), 

(36) e, ~~ p «-*+!.*+*,•••.»)• 

This transformation is reversible. The given differential equation thus appears 
finally in the form of an equivalent integral equation 


(36) 


'(*. p) = p) + />)«<({. p)^B-A[z( f, p)]d( 

+ pjf [£«.{*.p)Jb^T2[*({, p)\di, 

that is, the solutions of (36) and A (z) = 0 are the same. In the form (33) we 
could infer one and but one solution z(x, p) for a given set of c,, c,, • • •, c n ; in 
fact, that solution of A(z) = 0 which satisfies the conditions 

c V " d J 

= p) (>=°. I, -, n-1), 

but a similar inference is not possible for (36). 
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W it© 

£«<'>(*. />) = «' y< “"">*.(*• f: P)* 

t=l 

✓ Q*TX 

± «| q («. <»)«,(«. P) = P)- 

<-*+! 

For the consideration of (36) it is fundamental to establish that the functions 
<f> kl and yfr u are bounded in the following manner for |p| ^ R°: 

“ \K(:t:P)\*T. •*«*•*» 

(88) |^(*,f;p)|sr, azxztsb 

To prove this we note first that from (21) 

(39) wS ,J (x, p)=* ^ *%«(*» p), 

in which *„(*, p) denotes a polynomial in 1/p. If now we substitute in (34) 

(40) *,(*, p) - W0 %(*» Ph 

we find for the determination of $,(*, p) the n linear equations 

. 1° (1 = 0,1, •••» n—2), 

(41) 2>«(*. P)M*» P) * 1 1 (I = n — 1). 

tal 1 

The determinant of these equations is a polynomial in 1/P whioh we ca JJ® d 
^(x, p), and has the principal term £*(*) [see (23)] which is not zero. We 

conclude that for |p| S K* 

(42) K(*.p)| - n, p)I £ *• 

Now from (89) and (40) we have 

(43) |ui 0 (*.P)« i ({.P)l S l*'' / * ’'“I* 

whence m 

(44) \v\‘ ] ( x * P)*M' P)\ -l e# 1 
if we make the restriction 

(• = 1, 2, •**, 4), £ = * (< = 4+l,4 + 2,-.-,») 

To see this one recalls the inequalities (7). Hence 

i»l * 1 

() I ± U i ,, (*,p)fi,(f.P)U('*-*)|e'- / ‘'' ,<,> "v| (*»•). 

I<=*+! I ■ 1 

which are in effect inequalities (38). 
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We now consider that solution z h (x> p) of (36) for which 



(< + *). 

(»' = *), 


and we will show 

(a) that for |p| = R' one and but one such solution exists, and this is 
analytic in p ; 

(b) that the solutions z,, z 2 , •••, z m thus defined fulfill the relations (27) of 
our fundamental theorem. 

The linear independence of z,, z 2 , • • -, z m is then an immediate consequence of 
their form (27), since w,, u 2 , •••,«, are linearly independent. The demonstra¬ 
tion is thus completed. 


Proof of (a). 

We know there exists one and but one solution of (33) for all sets of values 
of c r To each set corresponds a definite transformed set c\ which we will show 
has the form 


where the y^(p) are analytic in p. To prove this statement we define 
Z k (x,p) t (£ —1, 2, *) v to be that solution of -4(z) = 0 which satisfies 

the equations 

d 1 _ d J 

faj z *( a ' P) - ~d£ u k( a ' P ) •••» n - 1) - 


If z is the solution of (32) for the set c,, c 2 , • • • , c h , we have then 

The Z k (x t p) are analytic in p since the coefficients in A(z) = 0 and the 
u { (x % p) are. If this value of z is substituted in (35), we obtain the transfor¬ 
mation in the stated form. 

From this we see that either a unique solution for the set c\ of (46) exists, in 
which case this solution will be analytic in p, or there is a solution of (36) for 
c i = 0 (i^li 2, •••, n). If then we prove that the latter alternative is 
impossible for |p| > R y we shall have proved statement (a). 

Let us now write down (36) and the equations obtained from it by differ¬ 
entiation, using the definitions (37). We then obtain 

* in (*,p) = £c:uin(x,p) + p r P )B^A [z({, p)]df 

i-1 J . 

+ p f; p)B^Ta [.«. p)] df 

(l = 0,l,..,n-l). 
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These are a set of linear integral equations in z(x, p), zl'\x, p), • • •, z^~ x \x, p) 
of the Fredholm type. 

Assume if possible a solution 2 ( 1 , p) to exist for c\ = 0 (i = 1, 2, • • • , n). 
If we write 

*!')(*, p) = p) (1 = 0, 1, »-l), 

the equations (47) become 

(48) * f (a>,/>) = + p f**(*.£; p)y[*(f»p)] rf f 

(1*0,1, • n—1) 


where 


y[i(x, p)] a £ [6/*, p) - «y(*. />)]*,(*. (>)• 


(1 = 0, 1, •••» n 1) 


(49) 

If W 4= 0 be the maximum of 

p)i 

ou (a, 6), we conclude from (29) that for |p| = R° 

(50) I? [*({• P)] I “ nD-W \p\~ m ~'. 

But in one of the equations (48), e. g., / » |z,(x, p)\ has the value W at 

x =» x x , so that 

(61) TK=|pj *„,(*,, {i p)?Wf.p)]<if+pJ* p)?[»(f.p)]^ 

Applying to (51) inequalities (50) and (38), as is possible since £ — *, in 
♦«,(*,, f; P) of (38) £ Sx, and in *„,(*,, p), we find 

(52) IF = n(6 — a)T • D - W \p\- 

which is not possible for 

\p\± m Vn(b - a)T - A 

Thus the set of values c\ = 0 is seen to be impossible for |p| = R. Hence a 
unique solution for (46) exists. 

Proof of (6). 

By (46) we have 




(53) 

Writing then 


(*,p)-“L n (*.p)+pJ^[S“ , i ' , (*.p)« i (f.p)]-B-^[**(f>p)] rf f 

+ p f [ t ■?*(*. p)“i(f > p)l B--A.[ z k(i * p)]^f • 
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(54) 


*«(*. P) = V H (x,p) + p£4>Jjc, f; P)g[x t (i, p)2dt 


pf ’#%,(*, f ip)9[* t (i, />)]<*f. 


where *„(*, />) and j(*) are defined as in (39) and (49). This equation (54) 
is analogous to (48). 

Let W denote the maximum of 

!*«(*- #01 (1 = 0, 1, ...,n-l) 

on (a, ft). Then we find in analogy with (50) that for \p \ ^ ft 0 

(56) \9[*s,(*,p)]\2nD-W \p \—*. 

If this maximum be attained for f - f„ * - x„ we find in analogy with (52) 

W - | + n(5 - a)T D W\p\~, 

whence, if m > 0 and |/>| is large, 

(66) JTSQ, 

since 

But from (54) 

!**'(*• p )-*«(*• p )I — JpjT *»(*. f; p)g[*k(t , p)]d( 

+pf+*(*'eip)*fo(e.p)w !• 

Therefore, using (38) and the inequality 

IS r [*.(*.f)]|£nZ)-Q| # ,|—>, 

which is a consequence of (56), we see that for a large enough |p| 

l**( x ’P) ~V u (x, p)l £ n(b-a)T D Q\p\—. 

Recalling that E is a generic notation for functions of p and other variables 
bounded for large \p\, we conclude from this at once that 

*.(*. P)=- «,(*, p) + e’S-'^E, p-, 

&**(*, p) + 


£**<*• <■)=£«.(*. />) + 
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BOUNDARY VALUE AND EXPANSION PROBLEMS OF ORDINARY 
LINEAR DIFFERENTIAL EQUATIONS' 

BY 

GEORGE D. BIRKHOFF 
Introduction. 

Let p 2 (x ), ;> 3 (x), • ••, j\(x) be functions of the real variable x on the 
closed interval (a, 6), which are continuous with their derivatives of all orders. 
Write 

•£(*)■&;+ • +/ , i( x ) *•* + / , -( x ) 2 » 

W j.. i 

Jr} + • +(“ 1 r*^ [/»,(*)*]+ —+/».(■)*• 

With the linear differential equation of the nth order in u 

(2) £(u) + Xu-0t 

and n linear homogeneous conditions in w(a), u(a), •••! w < "~ ,, (a), u(6), 

«'(6).«<—*>(*), 

( 3) ir,(«) = o, ir,(«)-=o, w.(«)-o 

we associate the like adjoint differential equation 

(4) jJ/(v) + Xi> = 0 
and n like adjoint conditions 

(5) F,(t>)-0, F 2 (r)-0, V m (v) = 0. 

For certain characteristic values of the complex parameter X there will exist 

•The second part of a paper presented to the Society (Chicago), March 30, 1907, under a 
different title. The first part of this paper has been printed on pages 219-231 of this volume. 
Received for publication May 12, 1908. 

t It is not an essential generalization to write instead of (2) 

if />,> + 0, i* + 0, and pJP ie real. We therefore restrict (2) as stated. Max Mason has 
treated a special case n = 2 with the restriction P+0 removed, by different methods: these 
Transactions, vol. 8 (1907), p. 427. 

Trana. Am. Math. Soc. 25 373 
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a solution* u(x) of (2), (3) or v(x) of (4), (5). These values are the same in 
both cases: let them be 

\* 

with corresponding solutions 

w.(ar), *,(*), •••, 

*.(*)* «,(*)* 

These functions w.(x), v.(x) are such that 


jf «.(*)•>(*)*-o 


(i+i) 


This property leads us to the formal expansion of a given function^*) on (a, 6), 

« T/(*)».■(*)** 

(7) /(*)~Lts -“<(*)• 


How are the characteristic values distributed in the X-plane? What is the 
nature of the solutions m.(x), tv(x)? In what sense does the expansion repre¬ 
sent/(x)? These are the questions considered in this paper. 

We begin with the derivation of the formal properties of the boundary value 
problem (§1), and of the expression for the sum of n terms of (7) by means of a 
contour integral (§ 2). 

There follows the more intimate study which is based on certain facts concerning 
the asymptotic nature of the solutions of (2) and of (4) when | X | is large. These 
facts are derived as an application of ray paper in a preceding number of these 
Transactions (§3). The distribution of the numbers X. and the nature of 
the expansion is then obtained (§4 and § 5). Finally the contour integral is 
evaluated and the representation theorem proved (§ 6). The expansion is found 
to behave like a Fourier series except in the vicinity of x = a and x = b. 

Liouville was the first to introduce the notion of adjoint conditions in a special 
case f and to consider the related expansion. The results of the present paper are 
known for the real self-adjoint case n = 2. J Westfall has proved a repre¬ 
sentation theorem for the real self-adjoint case n = 2m, providing that f(x) 
and its first n derivatives are continuous. § 


* By a solution we mean always a solution not identically zero. 

t Liou ville’s Journal, ser. 1, vol. 3 (1838), p. 561. Professor E. H. MooBE suggested 
to me the possibility of generalization. 

t A. Kneseb, Mathematische Annalen, vol. 58 (1901), p. 81 ; Dixon, Proceedings 
of the London Mathematical Society, ser. 2, vol. 3 (1905), p. 83. 

\ W. D. A. Westfall, Zur Theorie der Inlegralgleiehungen, A. MYLLEB, Geicohnliche Diffe¬ 
rential ghichungen hoherer Ordnung in ihrer Btzirhung zu den Inlegralgleiehungen. Gottingen 
dissertations (1905 and 1906). Reference should also be made to Hilbert’s antecedent papers on 
integral equations in the Gottinger Nachricbten for 1904. 
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§ 1. The formal nature of the boundary value problem. 

We must first make precise the adjoint conditions referred to. 

Definition. Let Tl'j(w), • ••, ir„(w) be n given linear forms in t/(</), 
u '(a), m—>•(«), w(6), u(b). •••, tt (m ~ h (b) and ir*(n) 

be any n further linear forms so chosen that W l% • ••, W u are linearly inde¬ 
pendent. Then in the identity 

(8) f zL(y)dx = P(y,z) + (*yJ/(z)dx, m 

where P(y> *)\'Zm * s a bilinear form in y(o),y'(a), •••. y'"" 1 («), 
y'(5)> •••, and z(a) % *(«). •••. *(&)» *• » 

we can write 

(9) P(|F. »)!*"*-ZlF,(ir)F^(.) t 

#■« 1=1 

in which V t (z) are linear in s(«), *'(<?), •• z(b),z(b), • ■ - % z"- l> (b) 

and linearly independent. Then (i = 1, 2, •••,«) are the adjoint 

conditions to the given conditions ]!'(»<) = 0 (i = 1. 2, n). Any set of 

conditions ll^(w) = 0 (» = 1, 2, • • •, n) equivalent to IP^u)■0(/»l, 2, • • •» n) 
by linear combination, with any choice of ll’’ (l<f ,(u), • ••, H^(m) leads to a set 
of adjoint conditions Pj(e) = 0 (i— 1, 2, . n) equivalent to K 4 (v) — 0 
(ial, 2, •••,«) by linear combination. 

Conversely, given K,(u)* 0(i= 1, 2, •••, n) we choose V^ l% •••, V u as 
above and find TF 4 (u) — 0(» = 1, 2,---, n) to be the adjoint conditions. 
Hence if the problem (4), (5) is adjoint to (2), (3), so also is (2), (3) adjoint to 

(4) . (5). 

The properties which this section proves are stated in I—III. 

I. If for X a X’ a solution u"(x) of (2), (3) exists , a solution v*(x) of ( 4), 

(5) will also exist for X = X’; if u\x) is unique (except for a constant factor), 
V*(x) is also unique (except for a constant factor). 

Let u * be the given solution of (2), (3) for X= X’. Then we have 

W'.OO- IF,(“')-TT.(u) = 0, 

and for some j 
as otherwise we should infer 

u'(a) » ii - '(a)» • •. = «•<-»( a)=u(b) = u’(b) = •■■ = u <-*>(6) = 0. 

In the n-fold linear spread of solutions v(x) of (4) at X = X* there will be at 
least one, say v’(*)i which satisfies the n — 1 linear homogeneous conditions 

_ K-i (*••■)=° ('+»• 

*Cf. ScilLKSlNOER: Handbuch der Thtorie der linraren Differentialglrichungtn, vol. 1, p. 64, 

formula ( 1 ). 
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Substitute in (8) y = u\z = v and we obtain 

_ j'vudx = IF„ t> ( .<•) - X* £\-vdx. 

Therefore V n _j(v) = 0 also, and v’ will satisfy (4), (5). 

If u is unique, v * is also unique. To prove this assume if possible that 
there were two linearly independent solutions v and t>" satisfying (4), (5) when 
\ = X*, while u is unique. Then we could choose i and j so that 

^Lo; 
f„ +i (o wol 

otherwise we should have constants c, d , not both zero, such that 

+ dv~) = 0 •••,»), 

and v = cv* -f dv~ would fulfill the 2 n conditions 

which is not possible since the V x , • • • , V u are linearly independent. Choose 
now m” linearly independent of u to fulfill the n — 2 conditions 

U' l (u") = O t (/ + n - » or n —j) 

where / ■ 1, 2, •••, n. Writing in (8) y = u H , z = v' and y = u\ z v **, we 

obtain, on simplifying, 

+ tf _,(«-) v,j(V) = o, 

+ IF^(u-) F. W («T) = 0. 

From this we deduce that also 

Therefore u” would satisfy (2), (3). This is impossible. Hence if u is unique, 
v m is also unique, which we were to show. 

Definition. If X = X' is a characteristic value of X for which one and but 
one solution of (2), (3) and (4), (5) exists, X* is said to be a simple characteristic 
value. 

II. Ify l% y 2 y are n linearly independent solutions of (2) at X = X*, 

the condition that X* is a characteristic value is that the determinant 


IF,(y,) 1F,(y,) ... lF,(y„) 

A= W t (y,) IF^y,) TF,(yJ 

*F.(y,) TF.(y,) ... TF,(y„) 
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vanishes; the condition that X' is a simple characteristic value is that not all 
the first minors vanish. 

The general solution of (2) is 


“ = c,y, + c, yj + ..- + cy¬ 
linder the condition A = 0 at least one determination of c,, c t , • • •, c„ is possible 
such that 

tf,(u) = o, ir,(u) = o,ir„(u)-o. 

This determination is unique when not all the first minors of A vanish. 

III. If ufix) and vfix) belong to distinct characteristic numbers X. and X y , 
then is 

u.Vjdx = 0 . 



For write y = i/ jf z = in (8). Because of (9) we deduce 
- X,jf = - \j£y.Vjdx, 


whence the desired relation follows. 

In important particular cases the problem (4), (5) may be precisely (2), (3) 
with v in place of u. We then say that the problem (2), (3) is self-adjoint. 
Or (4). (5) may be precisely (2), (3) with v in place of tc, except that M( t») is 
the negative of L( v). We then say that the problem (2), (3) is anti-self adjoint. 


§ 2. Expansion as contour integral. 

It is easy to prove that when X is not a characteristic number there exists a 
unique G(x y s; X)* such that the solution <f> of 

Z(*) + X* = », JF,(*)- ir,(*)- !?•„(*)-0, 

is given by 

<t> = ^ G(x, s; X) (o(s)ds; 

and likewise that there exists a unique 7/(i,s; X) such that the solution <f> of 


M(4>) + \<t> = <o, Vfi4>)= Vfi<f>)= 0, 


is given by 



X)o>(s)c/s. 


* Professor Bocher has dehned the Green’s function G tor an ordinary linear differential 
equation of order n under a special form of couditions, and has stated the principal properties in 
the Bulletin of the American Mathematical Sooiety, vol. 7 (1901), p. 297. 
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The explicit formula for G is 


I y.(*) y.(*) ••• y„(*) G(z,t;\) 
^.(y.) Jr,(y t ) ... ir,(y.) W t (0) 
w ,(y,) ••• w Ay.) w t {G) 


(10) <?(*,»,*)- A( X) " »^,(y.) lFi(y.) ■■■ TT,(y.) 


(- 1 ) 


. w t(yx) w,(y,) 


w,(y.) 


in which 


,^.(y.) K(y,) ••• K(k) 


I y,(z) y,(*) ••• y„(*) 

i!/.-*’(«) y?-”(0 ••• yf-"(«) 


k. .. _LA(i)_ y«(») 

(11) (?(*.«, x)-±|- jyp, (j) y; -., ( 7y 

yr*(«) y?"”(») 


y.(») 
y*.—’(») 

y';- 8 '(») 


i y,( 4 ) y,(«) ••• y.(») I 

( + if x> s, — if x < s). 

Here y,, y 2 , • • •, y, are any n linearly independent solutions of (2). There is a 
like formula for //(x, s; X.). Furthermore 

(r(x,s; X)=«//(s,x; X).* 

The function G is analytic in X (since y,, y 2 > • • •, y„ may be taken analytic 
in X) except for a possible pole when A(X) = 0, i. e. by II, § 1, when X is a 
characteristic value. 

]f\ = \. is a simple characteristic number for which G has a pole of the 
first order , the residue is 

f «,(*)«(*)& 

where 


Write 


Gr ( !e ’ ,iX ) = ?&? + '(*••! M. 


* For formulas like these and their proofs see Westfall’s dissertation, $$ 6, 7. 
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where «r(», s ; X) is analytic at X = X.. In view of (10) we have 

•«(*.*)- A'(\) • 

where *Vis continuous with its first n derivatives since the coefficient of G is zero. 
Further iV satisfies (2), (3) for all X so that It(x, s ) must be a solution of 
(2), (3) in x. Because of the relation between G and //, we infer also that 
R(x % s) is a solution of (4), (5) in s. By the definition of a simple character¬ 
istic value it follows that the residue is 

It remains to determine C t . 

By (12) we have 

lira [(X-\)<?(x, X)(*)»,(*)] =0. 

whence 


A-A« 


[i x - x,) X * G(x ’ * : x)u ‘( ,)ds - c < u ‘( x) - °* 


A-* 
But since 


£(«,) +Xu,-(X-X ( )«„ IT.K)- W'iK)- 

we have by the fundamental property of G 

(X-X,)J*<?(*,.; X)u ( (*)«f* = «,(*)• 

Substituting this value above we find 

lim [«,(*) - = °- 


ir„(.o-o 


Therefore 


out of which c 4 is determined; this proves that the residue has the stated form. 

If T be a contour in the X-plane which encloses we conclude by 

the above that 


(13) 


i c r ^ f > s(*) v <( x ) de 

Z=7=f I <?(*.*; x)/(.)*A-L4-«,(*)• 


In this proof certain points of logio are obviously slurred over. 


The J& K 

Acc. No. 


U-: 


varsity l, h 
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This is the sum of n terms of the formal expansion, providing that X,, \ 2 , • • •, \ 
are simple characteristic numbers at which G has a pole of the first order. At 
values X. for which these conditions are not satisfied, the corresponding term 
of the formal expansion (7) is to be replaced by 


r 


R,{x, >)f(t)ds 


where 7? (x, s) is the residue. Thus in all cases we have an expansion each 
term of which is uniquely determined. We omit the development of 7?.(x, s) 
in these more complicated cases. 


§ 3. The solutions of L(u) + \u = 0 and M(v) + Xv = 0 when |X| is large. 

As a preliminary to the deeper study of the problems which have been pre¬ 
sented, I apply the results of my paper printed on pages 219-231 of the present 
volume to the differential equations L(u) + \u = 0 and M(v) + Xv = 0. 

If we write X = p", the first of these differential equations can be written in 
the form there treated [see (1), loc. cit.], 

<Tu <7"-'u 

2*5 + p°n- 1 (*» + ••• + p)u = 0 y 

a .-i( x <p) = 0 ’ •••> a 0 (*,p)- 1 + &&. 

The coefficients obviously satisfy the restriction imposed, and the equation for 
w t (x) [see (4), loc. cit.], is 

(14) to: + 1 = 0. 

Thus to,, to 2 , • • •, are constants. We also find (proof of Lemma 1, loc. cit.) 

(15) *»(*)-! (* = 1,2, •••, n). 

In order to state in explicit form the theorem for this case, it remains to con¬ 
sider tue regions S (definition, p. 220). Inasmuch as w x , w 2 , • • •, w n are con- • 
stants we can for each p choose the indices 1 to n so that 

It(pw x ) ^ R(pio t ) = ••• ^ li(pw m ) 

for every x on (a, b). Hence every p is on some region S, 0 = arg p = yjr. 
At the bounding rays the ordering changes so that for some i and ; 
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Ii(ptc l )= 

An easy computation shows that the regions S are the regions 

(1=0, 1, 2n — 1). 


hr ^ <(/+ 1 )^ 

ar g p = 


( 16 ) n ” °' 

The differential equation M(v) + \v = 0 also is of the same type when we 
put X = p n . The roots io,(x) are 

(17) -w P -«<>,. •••. - to, 

and in the formal solutions we have 

( 18 ) * # (x)-l (i = l,2,...,n). 

The regions S are the same as for L ( u ) + Xu = 0. 

Let E be a generic notation for functions of p (and other variables), bounded 
when |p| is large. The application of the theorem referred to gives: 


On any region of the p-plane 

hr 


( 1S >) 


S: 


-- £ arg p S 


(< + !)' 

n 


( 20 ) 


there exist n independent solutions , 

y,. y,» —i y n of L(u) + p'u = o, 

of M(v) + P'v = 0 , 

analytic in p and such that on this region 

y,->,,(*■ 


( 21 ) 


??_ ^ _ u i x p) 4. *»».<»-«> — 


( 22 ) 


where 


E. 


z,= B ,.(x,p)+ 

/>) + «-'—*^-1. 


22 





G. D. birkhoff: expansion problems 


[October 


, w ti being the n roots of w* + 1 = 0, m any positive integer. 
§ 4. Distribution of the characteristic values. 


P 

9 a( x ) 


w,. w i* 


The condition that X" is a characteristic number is that the determinant A 
vanisires (II, §1)- The i-th element in the j-th column of A is I V^y.) where 
y , y,i are an ^ w linearl y independent solutions of L(u) -f u = 0. 

ii order to treat the equation A = 0 we take y. to be the y. of (20) § 3 where 
\ = />". In addition we assume the conditions W i to be normalized as follows. 
Reduce the number of conditions IF ( m ) = 0 of order n — 1 [i. e., containing 
either or w ( * ,-,) (6)] to a minimum, at most 2, by linear combination. 

Then, in those that remain reduce the number of order n — 2 to a minimum, 
at most 2, by linear combination. Continue in this way as long as conditions 
remain. The normalized conditions will have the form 

( 24 ) iF;(«)-irj«) + ir.(«)-o t 


irjH) = <.,«»•>(«) + X: 

W n (u) m W*(b)+ )£ Wlb), 




in which no three successive k's are equal. 

Definition. Let the to, be taken in every order such that for some p 4 = 0 

If n = 2 /i — 1 and always neither 0 O = 0 nor 0 X = 0, where 0 O and 0 X are defined 
by the identity 


(25) 0 o + 0 l s 


a, w\' • • • 


(«i+ «&)«£ 


• • • £,«»;« 

ajW 1 ? ••• 


(<** + «£ ,)<* 

&«£+i 

... 

> 

a n w h r ••• 


(«. + *£.)«£' 


• • • • 

••• /j.«* 


or if n = 2 n and always neither 0, = 0 nor 0, = 0, where 0 O , 0,, and 0 2 are 
defined by the identity 
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( 26 ) *, + *,. + £■ 

a x w\' • •• ajtojLi (*,+*£,)w*' 

a a w?*» ... a 2 u>*’_, (a t +s& t )to k J 

a.wl- («.+«&>£ 


( a . + !^,) w * +i ^ ,w “ v ’ ■■■ ^ |W "' 
(«.+ ;A.)x%i £Xv. ••• AX' 


the conditions TF,(w) = 0 (i = 1,2, • • •, n) are said to be regular.* 

We now proceed to prove the 

Theorem. If the conditions IF (m ) = 0 (i = 1,2, • • •, n) are regular , Me 
characteristic values of X /or (2), (3) and (4), (5) are in general simple \ and 
form a pair of infinite sequences X„, X llf (7 = 1,2,...) such that 


( 27 ) 




tc/iere * 7 ,^, * 7 ,,, r/re constants and | 2 ?,, j < J/ , £*,„ | < -I/. 

We consider the equation A = 0 in the p-plane, fixing attention on some region 
(19) for 7 = 7 0 . The transformation X = p" makes two adjacent regions S 
correspond to the entire X-plane. It is convenient to consider separately the 
cases w = 2/i — 1 and n = 2p. The starting point is the set of relations 
[deduced from (21)] 

w J.y,) = (/>»/)’' [ a < + E y + - 

(28) r' ei 

'W = J • 

# This apparently givea one condition for each region S, bnt these reduce to two for n =2/< — 1 
and to one when n =2 u. It is worthy of note that the Sturm*LI 0UVILLE boundary oonditions 
(n = 2) of the form 

Au(a) T l»'(a) = 0, Ju ( b ) + mu' ( b) =0, 
and also the periodic boundary conditions (n = 2) 

«(o) = «(ft). *'(n) = u'(t), 

are regular. An example of ffbn-regular conditions ( n = 2) is 

u(a)=0, «'(«)=**(»). 

f See § 1. 
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Case 1. »« 2/* — 1. 

By definition of the regions S we can choose indices 1 to n so that 

for the region (19) under consideration. Making this choice we find readily that 
on S 

/?(pu>.)<0 (* = 1. 2| •••» /* 1)» 

R (P w i) > 0 


(29) 
while 

(30) 


(< = /« +1, ^ + 2, • 


R(l>w,) = 0, ar gp = ^ l °^K . 


From (28) and (29) we see that in the formula 


^(y>)= ”'»(%)+ w »(y.) u= i. 2. i). 

the term can be absorbed into the term IK # () since e^ lb ~ n) is small 

of an infinite order in 1/p on that is, we can write 


(31) ^(V/)-(#>»/)*• [° i + L a | ^' + ^] tf=l. 1). 

By like reasoning we find 

(32) IW-^‘-w[A + |'y+J] o = „ + i, „ + 

Also from (28) 

(33) ^,(y.) = (/>«’.)*{<*.++ 

+ pu>. )*• [ e, + zj-£ ! +£]. 

If we substitute in A these values of W t (y } ) as given in (31), (32), (33), and 
remove the factors ft* from the i-th row (i = 1, 2, •••, n), and the factors 
f rom the j-th column (; = fi + 1, y. -f 2, .. -, n), i. e., altogether the 
non-zero factor 

fiA ri 

<«» i=»- rl 

the condition takes the form 


(34) 2 = (#. + | JS + f) + «'—•>(«, + g J + f‘) = 0, 

a= np*-. n c^-' a. 
i=l >=» + ! 
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The quantities 0 O and 0, are the 0 O and 0, of (25). In fact (25) was formed to 
give these terms. 

Since A was analytic in p, A will be also analytic in p. 

The expressions of the first minors of the /*-th column of A, 

b E 

a+ - P + - + r' 

cannot have all their first terms a = 0 since then 0 O = 0, = 0, contradicting 
the hypothesis. Hence for large values of |p| not all these first minors vanish. 
If we return to A, we see that accordingly not all its first minors vanish. The 
characteristic numbers when |X| is large must therefore be simple. This was 
part of the theorem. 

From (34) we obtain the equivalent condition 

-*(#»). *(»>)“ -i+'’£%+£ 

by solving for From this we infer 


(35) pn m (b — a) — log $(/>), 

log S(p) - log( - jj) + E £ + ~ ± 2lW-i, 

the final form of the condition, in which of course E is analytic in p on S. 

In view of this condition, it is obvious that the values of p on S which satisfy 
A a 0 have when | p | is large the form 

2M/31 lo s(~g°) yj h, E 

Pl ~ "•»(4 — a) + ®.(6 - a) + ftP + 1” 
where / is a large positive integer. Since by (30) 



arg o>„ - 


n ”2 


we see from (36) that the d= sign is to be so chosen that the p, approach asymp¬ 
totically (in an angular sense) the bisecting ray of S. Let now p describe a 
circle of fixed small radius r about 

wn 

w.(6-a) + u>„(6-a) 

This will be wholly within S when |p| is large; and 

arg [p«v(&-a)-log8(p)] 
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increases by 2 tt if the inequality 

I r 


■-i 


obtains as is true for |p| large. This proves the unique existence of p, when 
p\ is large , and the distribution of values p is determined. 

It remains to return to the X-plane; S becomes S' 

S': 1 0 tt = arg X =(f # -f- ljw. 

The values of X approach the positive or negative axis of imaginaries according 
as l 0 is even or odd, and by (36) have the form stated in the theorem where for 
some 0 O , 0 X 

n i°g(-£) 


9u or 9m = =*= 


2ttv/-1 * 


This completes the proof when n =» 2p — 1. . 

Case II. n = 2p. 

The proof is analogous to that for Case 1. if pm,, pw jy •••, pw n have 
increasing real parts on S we find 


while 


7?(pu».)<0 

R(pw i )>Q 


(i=M + 2,/^ + 3, •••, n), 


R(fw M )= Rif™** i)= °» arg p = either" or , 


and w M+1 = — . The region £ is of course defined by (19) as before. We 

now find Wfy f ) to be of the form (31) forj *= 1,2, • • p — 1 and of the form 
(32) for j = p+ 2,p+3 , • • •, n ; also for = p, p -f 1 we find 


(37) 


^(y. +1 )- (- /*>. p [«,+g ^1 ] 


+ «• 

writing w M+I = — w„. 

These expressions H r .( y> ) are substituted in A and a factor 


iia ri 


gPU/*-') 


is removed. The equation A = 0 becomes then of the form 
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( 88 ) + + + + + 

A=n a ri e" / *- , A, 

1.1 

0 O , ^i« being defined as in (26). The equation (38) is quadratic in e' u »'*~ a) . 

This equation (38) will yield the forms stated in the theorem for the charac¬ 
teristic values X„, X, M on S. They must lie near to one of the bounding rays, 
say argp = (J 0 + 1 )i r/n. If we consider the necessary form of the expansion 
on the other region S adjoining this ray, i. e., on the region 

n 6 r n 

we obtain the same equation (38) except for different E terms perhaps; hence 
the same necessary form for X M , X m as before. Thus we have the necessary 
form for the entire X-plane. 

The final fact of unique existence is demonstrated as before unless 

( 0 

„ or arg 7 , n = 

when the values p u> p lu may come indefinitely near to the bounding ray 
arg p = (f 0 + 1 Jw/n. By considering first the case 

!>.(*)-/>.(*)■- 

when the solutions y 4 are known, and by using continuity considerations, the 
fact of unique existence can be established for this case also. 

§ 5. Nature of the expansion. 

In this section we give a notion of the character of u 4 , v■ and of the expan¬ 
sion (7). 

Let us first develop the character of w„ in the case n = 2p — 1. Clearly 

( 39 ) Wn=c,y, + ••• + c ,y. 

where we have for the determination of c,, c,, •• • , c, the equations 

(40) «iW ? ;(y,)+e,w r ,(y,) + --- + «.w r j (y.)-o (.= 1 , 2 ,•••,»). 

Substitute in the matrix ||B r i (y / )|| the expressions given in (31), (32), (33) 
and remove the factor p** from the ith column (i=l,2 ,•••,/*); write for p 
the values (36). This matrix then takes the form 
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1 Pn 

P„ - 

Pu 


• • • e~*P Xn 

\ P » 

• 

P„ ... 

Pu 

• 

*2s±! 

• • • • 

... e'^Pu 

I 

|| 

P., 

P„ ■- 

p.. 

»/ •'li+l 

••• •'SpJ 


where the P„ are of the form 


, b E 

■+r+-+ P . 




The c { are proportional to the minors of any row, say the first. The minors 
of the elements 1 to m contain all the factors 

n «'*. 

besides a factor like (42), while the minors of the element i,i> contain all 
these except f 9 * 9 * besides a factor like (42). We conclude that 

c <“ G. (* = 1,2, •••,/<), 

(43) 

where Q t is of the form (42). **“' «=, + !,, + *, «), 

If we substitute into (21) the values (36), we obtain 


( 44 ) y< -*"5(S) rj 

where Y { is of the form 

(«) «+*£! + ... + * 


(. = 1 , 2 ,»), 


Placing the expressions for c. and y. of (43) and (44) in (39), we derive the 
form of «„; likewise the form of v u , i s obtained. We have finally 


”« = x:n) Uu + ( iz)u u + £ 

Ul “ = «" (S) U u , + £ ."StSJpr 

<=*+1 

<=M + 1 


<=M + l 
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f = 2/7r 1 + 2 7rl '- 1 - / " , r — 2/ir ,/-1 - W - V/l " 


n 


Here V are of the form (45) and n = — 1. 

When n = 2/* we get in the same manner 


(47). 


(■I 


III* 


10 


. + ±f 

/si le.'t 

<S| tantl 

r 11I+ .-o k,.. + /© r„. 

+ t e-C^V,,,, 

r-2/^-1 + , r-2iwy- 1 + . 

n n 


In both (46) and (47) it is to be noted that the 2-terms are not important 
except at x = a and x = 6, since the real parts of the exponential terms are 
large and negative for / large and a < x < b. We oinit the fuller discussion, 
to be made by (46) and (47). It is worthy of remark that the simplest case 

n-1. L(«)-u t W t (u)-u(a)-u(b) 

yields the Fourier series. The general expansion is clearly of a similar nature. 

§ 6. Convergence of expansion tof(x). 

We have expressed the sum of n terms of the expansion (7) in the form 

where T is some contour which contains X,, X a , •• •, \ but no other character¬ 
istic values within it; G is explicitly defined [see (10)] in terms of the solu¬ 
tions y. of L ( u ) + Xm = 0. By means of the known asymptotic character of 
the y. we determine the character of G , and then evaluate the contour integral 
as T enlarges without bound. 

For convenience let us use the notation [u>] for an expression 

a E 

n + “ +-1- Tit 

P P 

where w , a, • • • are independent of p but need not be constants. 

Trans. Am. Math. Soc. 26 
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Theorem. Let f(x) be made up of a finite number of pieces in the 
interval a = x = b, each real , continuous , and with a continuous derivative. 
Unless x = a or x = b , the expansion for f(x) connected with the differ¬ 
ential equation L(u) + \u = 0 and the regular boundary conditions 
JP.{u)= 0(i = 1,2, • ••,») converges to [/(* + 0) +f(x — 0)]/2. At 
x = a the series converges to 


and at x = b to 


<*./(<*+ °)+ <*,/(* “°) 
*,/(« + 0 ) + 6 ,/(*- 0 ), 


where a,, a 2 , b x , b t are constants independent of /(*).* 

Proof We restrict ourselves first to the case n = 2/x — 1. At the end we 
outline a similar proof for n = 2/m . 

Let the contour T be taken as a circle | X| = k in the X-plane. If the trans¬ 
formation X =» p n be made, we find 


'#(*»*; *-)f(*)d8dp + J* £np m ~'G(x, s; \)f(s)dsdp. 
In this expression 7 , is the segment of a circle j/>| = k' lying on a region S> 



v - M » p 


«. + *)* 

_ 1 » 


and y, is the segment of the same circle lying on the adjacent region S, 



n 


arg 


((. + »)« 

n 


We confine our attention to the partial integral 


(48) 




taking a<x<b. It is found to tend toward f(x — 0)/4. By considera¬ 
tions of symmetry the limiting values of the remaining partial integrals are 
determined. The first part of the theorem then follows. 

Let B be the bisection point of 7 , and A , C its end points. Take AB as that 
segment of 7 , for which 
(49) B(pw r ) < 0 

and BC as that segment of 7 , for which 


<M)_ *0*,)>o. 

* For de6nitions of regular conditions see §4. The restrictions imposed on/(*) might be 
lightened but only at the coat of brevity. 
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The indices 1 to n of to i are chosen as in (29) and (30). We must break up 
/**■* in two parts /Jy and /jy, both of which are attacked in the same way. 
To treat G on AB , write 

(51) 0(x,«; x) = ±$!>,(*)&(») { + 

where the y { of (11) have been taken as the y t of (21). Then y i is the coeffi¬ 
cient of r 4 in 


T l 

t . 

... T 

m 

m • m 


... *-*>(«) 

— V 

y.(«) 

y.(») 

y.(«) i 

[»PW 

rw 

- ’(«) j 

• • » 

> r*(«) 

• • 

... ^-»*(.), 

. 

y.(«) 

*(•) 

••• y.(») ! 


But by (21) 

(52) #•(•) 

If these expressions be substituted above we find 

(53) j^=i 

since — to./n is the coefficient of r i in the expression 


T l 

T * 

••• T „ 

to;- 

to;-* 

... 

l 

1 

... 1 

to;- 1 

to;-‘ 

... 

to;-* 

to;-* 

... to;-* 

. « 

1 

. 

1 

... i 


•The denominator contains the factors which we divide into the corresponding i-th 

oolnmns of the numerator. It is to be remembered that, if / + 0, 

nrffl- 
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£i^m.= 0 (i = 0,1, •••,* —2) X> r , m, = l. 

•-i <■* 

These show that m i has the stated value. 

By (51) 

G(x,s-, X)= + ljy i (i)y i (s) 

(■I 

since a < s < x for the partial integral I m A y. Also 


H'(ff) = - iZ w,(»j)U‘) + iZ r.WVA*)- 


From these equations it is clear that if we multiply the columns 1 to /a of 
s; X) in (10) by + ly,(j) (» — 1, 2, • /*). the columns p -f 1 to n 

by __ }?<(*) (‘ = M + 1, /* + 2, •••,»*) and add them to the last column, this 
last column has the elements 


+ !><(*)&(*)• E W u (y t )y (a) (<= 1 , 

1-1 J-l 

By (21), (28), and (53) these are of the forms 


(54) 




Hp i-I (E^e"'(-V'[<x,to} rt ']) (4=1. 3, »). 


In the new notation equations (21), (31), (32), (33), and (34) give also 


(55) 


Vi 




[i] 

"(J'] 

/>*'( [’,<] + 

y ««>*-»[ ,8, w*.] 


(i = l, 2, • ••,»), 

0 = 1 . 2 , •••,/>- 1 ), 

(>=/> + !. M- 2 , •••, »), 


m*)— ny n ([»,]+ 

<=1 j=*rl 

We introduce these expressions into (10) where the elements of the last column 
of N have been modified to the form (54). The factors of the denominator A (X) 
we can divide into the numerator as follows: the factor — 1 into the last column, 
the factor p k < into the (i + 1 )-th row, the factor einto the j-th column, the 
remaining factor A into the /*-th column. We thus obtain 
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(56) G(x y s; X) 


1 

np- x x 


h-n - 16 SSSISS 1 - w-h 






[*.«!•] 


£>"»'->[ io,] 

<=l 


j-i 


From this form of G we can easily determine the limit of I" A y. 

Assume the sequence of circles so chosen that their distance from the nearest 
point p ln p lu is at least d > 0.* Then on A D when |p| is large 


(57) + 

In fact if 


where |<£| is small, we infer at once that the point p lies near to p u or p lv . 

In view of (57), even/ element of G given by (56) is bounded on AH since 
the exponents have negative or zero real parts [see (29) and (49)]. Further, 
when this expression for G is used in I" A ~ B r the factor npT~ x cancels. 

Consider those terras of I* AB which do not contain the first element of the 
last column ; s is confined to this column. When we integrate these terms as 
to s, they have the form M/p (M bounded), as an integration by parts between 
the points of discontinuity of f(s) shows. But these terms all contain as factor 
an element of the first row, not the last element, beside other bounded elements ; 
since these first row terms are small for | p | large, except the p-th one when p is near 
to i?,f we conclude that the p-integration of them will yield only small terms.} 

•This refers to the minimum distance of pu, pm from any point of the circle. In the proof 
of the theorem of \ 4, pu, pm (cor responding to >■u. >iu) were found to be approximately equally 
spaced aloDg the bisectors of 8, and S,. The above construction of a sequence of circles is there¬ 
fore possible. 

t Then pic„ is a pure imaginary. 


* 



maximum of r X ^ • 


This is small everywhere on AB if t is small. Even if r is small except for a little part of AB, 
this integral is small. ‘Small’ means of course indefinitely small as |p| becomes indefinitely 
large. 
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Accordingly we need only consider the terms which contain the first element 
of the last column. This element has a coefficient 1, as it had originally in 
G(x, s; X). Here again we can restrict ourselves to terms in the brackets 
which do not contain 1/p. Finally therefore 


/V. 


AB 




where e signifies a quantity which tends to 0 as \p\ increases. 
Integrating by parts we find 


£e'' rix - t) w i f(s)ds = 


A*- Q) , * 

P P 


(. = 1 , 2 , 


where € is certainly small unless i = p and p is near to B. From this we con¬ 
clude that 


jf jf ^-"wj(s)d»dp _ -*^/(* -0) + «. 


and therefore 

(58) 


/w-£/(*-o>+«. 


To treat /"/<? we multiply the columns 1 to p — 1 of N(x, s ; \) by 
+ iy*( 5 )» an ^ t * ie c °l u,,,ns P to n by — Jy.(s) and add them to the last 
column. The elements of the /t-th column can be written in this case 


<(«>-.<-.[a,] + fft]) 

*,J + [*,]’ **■-'—’[«.] + [«,] 

and all the elements are bounded as before. The important term again comes 
from the first element of the last column, and we find precisely as before that 

(59) 7 w-Tr/i—o> + «- 

Adding (58) and (59) we find 


and by symmetry 

/;-*=*/(*-°)+«- 7 r-i/(*+°)+*. /j 4 -*/(*+«)+•. 

Hence we find 

f( x “0) +f(x +0) 

1 r — 2 -+ «• 


This proves the first part of the theorem. 

When x = b the preceding work must be modified, since now the elements 
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H + 1 to n of the first row in (56) have the form [1]. Other elements of 
the last column now become important beside the first. An integration by parts 
with respect to s followed by a p-integration gives us terms 

*,/(« + 0) + 5,/(5-0). 

The same result must of course hold at x = a. 

When n = 2/x the attack is almost the same. One obtains a determinant 
expression for G by modifying the last column and distributing the two factors 
of A into the /*>th and (p + l)-th column.* The elements are bounded on 7 , 
in this case and one obtains first which tends toward \f(x — 0). The 
remainder of the proof is as before. 

•See (38) in which I is factorable into 

([♦,)+]> ( cm+< ) >. 
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Reprinted from Trans. Amer. Math. Soc., April, 1909, Vol. 10, 
pp. 259-270. 

EXISTENCE AND OSCILLATION THEOREM FOR A CERTAIN 
BOUNDARY VALUE PROBLEM* 

BY- 


GEORGE D. BIRKHOFF 

It is the purpose of this paper to consider the existence and oscillation of the 
real solutions of a linear differential equation of the second order 


(1) 

as? +?(*• M“ = ° 

(a<x^6) 

subject to the self-adjoint boundary conditions 


(2) 

a 0 u(a) + 0 9 u(a) = y 9 u (b) + * 0 u(b) % 
<*^'( 0 ) +0 l u(a) = y l u(b) + S l u(b), 



- V, - 7.S, - s.r,, 

where the real coefficients a 0 , y 9% S 9 and a, t y % , 8, are not proportional. 

The function q(x , X) is assumed to be continuous in (x, X) for all real values 
of X when x lies in (a, b) and to increase steadily with X in such a way that 

(3) lim q(x, X) = — co, lim q(x t X) = -f oo. 

We lay aside the trivial solution u m 0. 

Sturm considered the problem under the special boundary conditions 

(4) a,u'(a) + 0,u(a) = 0, 7l u(b) + S,»(4) = O.f 

Mason $ has proved the existence of an infinite number of values of X furnish¬ 
ing a solution, when q(x , X) has the form \A (x) - g(x) % and has given also 
an oscillation theorem for the special conditions 

( 6 ) «(<*) = w (h) % u(a)=ti'(b). 

B ocher§ e mploying other methods has removed Mason’s restriction on 
* Presented to the Society (Chicago), January 1, 1908. 

f Lionville's Journal, ser. 1, vol. 1 (1836), pp. 106-186. His linear differential equation 
appears in a more general form. 

{These Transactions, vol. 7 (1906), pp. 337-360. 

§Comptes Rendus, vol. 140 (1903), p. 928. 
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q (x, A) for the conditions (5) except that a certain uniqueness of the values 
X is not proved. The method of this paper is like BoCHER’s, being based on 
certain theorems due to Sturm. 


( 6 ) 


§ 1. Condition for a solution. 

In order to have a convenient notation let us write 

At w (*)] = + £<> w (*)> ^.[“(s)] = y 0 u X x ) + s „ u ( x )* 

A[*(*)] = a i u '( x ) + *,[«(*)] ■ 7,*'(*) + $,*(*)• 

Furthermore we assume 

( 7 ) a oPx ~ P» a i = 7 0 $, - ^« 7 , = 1 , 

as we may, since we can change either a 0 , £ 0 , 7# , 5 0 , or £,, 7l , 6, by a con¬ 
stant factor.* 

We now obtain an explicit condition that l shall be a value of X for which 
there is a solution of (1), (2). Determine t/ # (x, X), t/,(x, X) as the solutions 
of ( 1 ) for which 

A>[“»( n > *)] ■* °> A[“.(°.M]-i. 

i 0 [u.(a,X)] = l. A [«,(<», X)]_0. 

These equations fix w 0 , ti', ti,, u\ at x a, and hence determine w 0 and ?/,. 
We readily find that 

( 9 ) A>Oo(*’ X )]A [>.(** M] - A, [>.(** X )]AOo(*» x )] - - 1, 

for the left-hand member reduces to i/'u, — = constant, and this constant 

must be — 1 by (8). Also we have 

(10) M,[u,(x, \)]Ar, [u,(z, X)] X)]ir, [«„(*, X)] - -1, 

since here also the left-hand member reduces to w'n, — ?/j w o . 

The necessary and sufficient condition that there exists a solution uf 0 
when X = / is that <f>(l) = 0, where 

(11) *(X) s M 9 [u t (b, X)] + If, (>.(&, X)] - 2. 

In fact the solutions w 0 and ti, of (1) are linearly independent by (8), and there¬ 
fore we may write any solution u of (1) in the form 

If this expression be substituted for u in the conditions (2), we obtain, after sim- 
• If, however, 

<V ? i “ fa *i = ~ 4»/| =0, 

this is Dot possible. Id this case the conditions are of the Sturmian type (4) and we do not 
Deed to treat this case. 
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plifying by means of ( 8 ), 


c i = [".(4. »■)] + c,M 0 [u,(b, X)], 

C . = C .-V. [“.(&. *)] + 0,(6, X)] . 


These two equations are linear in c 0 and c,; a solution u $ 0 which satisfies the 
conditions ( 1 ) and ( 2 ) will therefore exist if and only if the determinant 


(13) 

vanishes. 


♦(*)- 


l-Jf, X)]. 


1--V. [“.(*• X)] 
X)] 


By means of (10) for x = b, this identity reduces to (11). 


§ 2. Simple and double solutions. 

A value l of X for which a solution u $ 0 of (1), (2) exists is said to be simple 
if all the solutions are linearly dependent: if, however, there exist two linearly 
independent solutions of (1), (2) for X - the value l is said to be double. The 
necessary and sufficient condition that \ = lisa double value is that 

(14) ^ o [H o (i,f)]- J V 1 [u l (4,f)]_0, *,[« 1 (M)]-.*i[*(4,0]-l- 

In fact two linearly independent solutions of (1), (2) exist if and only if the ele¬ 
ments of the determinant <f>(\) given by (13) all vanish. 

The question now arises: How does < 6 (X) behave at simple and double values? 
It is the object of this section to show that «#.(X) changes sign at a simple value 
of X and preserves its sign at a double value. 

It is essential first to derive some formulas. Let us change X to X = X + SX; 
all functions /of X will then change to /-/ + «/. The function 8 u„(x, X) 
fulfills the conditions <V ' 

( 16 ) [*“.(»• *)] +?(*. h)&u t (x, \)=-Sg(x, X)S,(x, X), 

< 16 ) *“.(<«. = *■)] = «; 

the equation (15) being obtained by subtracting from one another the equations 
satisfied by u 0 and 5 # , while the equations (16) are a consequence of the fact that 
u o( a > M and i^(a, X), as determined from ( 8 ), are independent of X. A like 
set of equations holds for $u,(x, X). 

From the non-homogeneous linear differential equation (15) and the condi¬ 
tions (16), the function Su 0 (x, X) can be explicitly obtained; we have 

(17) Su,(x, \)=£ K(x, X)u,(f, X) - u,(x, X)u 0 ({, X)]S ? (f, X)« 0 (f, X)<ff, 
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as may be proved by a direct substitution. If we differentiate (17) as to * we 
obtain 

( 18 ) ~ [*«,(*. x>] =£(«;(*. x>,({, x)-«;(*, xK(f, x)]«j(f, x)u 0 (f, \)d(. 

From (17) and (18) we get 

(19) 4JX)] -£ X)]u,(f, X)-^ 0 [u,(6, X)] U „(f, X))S ? (f, X) 

x U 0 (t, X)d{, 

(20) { if, K(&. X)] U.(f, X)-M, [u,( 4, X)1 utf, X)} X) 

x u 0 (£, X)d£, 

and in like manner one can prove 

(21) X)]-£(^.[u.(4, X)]u,(f, X)-^[ U ,(4, X)]„,({, X))S 9 (f, X) 

(22) 8AT,L«,<4, X)]_ £ (Jtf,[ Uo (4, X)]u,(f, X)-^[u,(4, X)]u 0 ({, X))«g(f. X) 

x “,(f i X)df. 

We are now in a position to consider the sign of <f> near a value l of X for 
which 4> = 0. We first take the case when l is a simple value. 

By means of (11) we have 

*-**K<4, x)] + ur 1 [s(».M]- 

Therefore by (20) and (21) we obtain 

( 23 ) -r ^ a ' (f ’ x>c ' (f ■ ^“.(f ■ x w«* x > 

+ ^,[««(4, x)]*,({, XK(f.X)-^,[ u ,(4, X)]u,(f, X)u,(f, X))4 ? (f,x)i{. 

It is our purpose to investigate the sign of B<f> for small $X by the aid of this 
formula. 

The integrand on the right is the product of two factors, the second of which, 
fy({, X), kas the sign of $X, since g(( t X) by hypothesis increases with X. As 
the quantity SX becomes smaller, u g and t/, approach u 0 and u,, and the first 
factor of the integrand approaches a homogeneous quadratic form in u 0 ({, X) 
and u,(f, X) which is obtained when we replace u 0 ({, X) and «,({, X) by 
u 0 ((, X) and *,(£, X) respectively. The discriminant of this quadratic form is 

{— -V 0 [u,(4, X)]+ [u,(4, X)])’+ 4i/' 0 [u,(4, X)]^f,[u,(4, X)]. 


If here we substitute the value 


M 0 [u 0 (b y X)] X)] - - 1 + Ar 0 [u,(6, X)]if,[u 0 (6, X)] 
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obtained from (10), the discriminant becomes 

{3f 0 [u 1 (6,X)]+^ 1 [t/ 0 (6,X)])*-4, 

which vanishes at X = J by (11) since <f>(l) = 0. 

Now it is clear that the three coefficients of the form are not zero except 
at a double value; otherwise with the aid of (10) we derive equations (14). 
Also the functions w # and u, are linearly independent. Thus as X tends to a 
value /, the limit of the first factor of the integrand is an expression of 
definite sign for a = f = 6, the same as that of M 0 [u 0 (b, /)] and J/,[w,(6, /)], 
save at isolated points at which it vanishes. 

We conclude that at a simple value l of X, $( X) changes sign in such a 
way that 8<p/S\ is of the same sign as Af 0 [w # (6, X)] and M x [u,(6, X)].* 
Let us now pass to the case of a double value l. If we recall conditions (14), 
we find that for X =» / 

*,[•<,(». *)] = [“.(*• *)]» *.[«,(», *)] = X)] , 

*.[«.<»• M] = ! + *^.[“.(4. X)], M, [u,(4, X)] = 1 + [*,(&, X)]. 

Substitute these values in (10), taking X = X, and we find 
SM„[u,(b, X)] [u,(b, X)] - {«J([*,(», X)] + SAf, [u 0 (b, X)] 

+ bif t [U|(6, X)] [u 0 (6, X)] } •= 0 

so that 

*0 -**>[«, (*, X)]**, [*,(&, X)] - 5A/ 0 [ U| (6, X)] [u 0 (b, X)]. 

•ThiH argument is satisfactory when the ^-derivative of q( x, A) exists on either side of / and 
is positive for some x of ( a, b ). 

Assume now that the above statement is not true for p on one side of /, say for A > /. Replace 
q in the interval (l , l + d ) by 

«•(*, *)=*(*, O +^jrf(*. l + d), 

which satisfies the above condition. Then we have 

*•(!)=*(!) =0, f*(|+d) = *(/ + *). 

But the above statement does hold for f. From this it follows that for a proper choice of d, as 
small as we please, we shall have *•(/' )=0 where K/'^f +<f. But for two successive simple 
values for which *• (A) = 0, il 9 [ uj ( b, X) ] and M x [ u\ ( b, A ) ] must change sign (or vanish) 
81 no® the sign of Sf m /iX does, and at a doable value both vanish. We conclude that 

M] = 0. Jf,[«!(*, A,)]=0, (!<>.,, >.,</+<*). 

Let d now tend to zero. Then we infer 

A/ 0 [u # (6,/)] = if I [« 1 (6,,)] =0 . 

Since ?(f) =0 these relations, combined with (10) when z = f, show that / would then be a 
double value, oontrary to hypothesis. 
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If now the expressions (19), (20), (21), (22) be simplified by the use of (14) and 
substituted in this expression for &£, and if we write for fi 0 (£, A), tz,(£, A) 
their limiting values w 0 (£, X), m i(£i we obtain 

- JT‘«!({, x)«»( f, \)df jf «f(f. x)i{ 

But by a familiar inequality 

and ^ are linearly independent real continuous functions. Hence, 

if we write 

/_«,(f.X)»'±«j({.X), S r = u,(f,X) l /±S ? (f I X), 

using -f or — according as B\ is positive or negative, it is apparent that the 
quantity (24) is negative. Therefore <f>( X) preserves a negative sign at a double 
value of X.* 


provided f 



§ 3. The existence theorem. 

Now let the infinite set of values of X furnishing a solution u of (1) for the 
Sturmian conditions 

(26) Z,[«.(<»))-0, Jf„[ U (6)]=0, 

be denoted by 

x,,x„... (*,<*,<•••). 

These values separate the X-axis into the intervals 
(26) 

This division into intervals is not uniquely determined, for the conditions (2) 
can be replaced by any two linearly independent conditions which arise from 
(2) by linear combination. 

The existence theorem. There exists an infinite set of values l t , ••• 
of X furnishing solutions of (1), (2). If we take these quantities in order of 
increasing magnitude counting each double value twice, there are the following 

• This argument is not satisfactory uoleas the ^-derivative of q (*, X ) exists on either side of I, 
and is positive tor a ^ x ^ b . 

If the derivative does not exist, we proceed as in the previous footnote and prove that 
M'[u 0 (b,X)) Mtiulib, X)] wish in ih* vicinity of l as well aa at /. This is impossible 
since the seroe of these fanotions are separated by finite intervals aooording to the results of STORM. 
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possible cases : 

I, h<\< l t = \ = h<\<K = \ = l i <\-, 

II, \ = l x<\<h = \ = h<\<K = V", 

Proof. Clearly u 0 is the only solution of (1) (save for a constant factor) 
which satisfies the first condition (25), and if the second of these conditions is 
also to be fulfilled, we must have i # (5, X)] = 0. Hence we see that 

(27) JtOb(*,*)]-0 for X = X,, X t , •.., 
but for no other X. It follows from (10) that 

( 28 ) \)]^1 [M*« \)] " 1 (i = l, 2, •••). 

Substituting the value for M 0 [u t (b t X.)] obtained from this last equation in 
(11), we find 

< 29 > 

We infer that $ has Me same sign as if, [«,(6, X)] at the values X- X,, 
unless if, [u 0 (6, X)] = 1, when <f> vanishes. 

But since u,(o, X), u,(a, X) are independent of X, it is a consequence of 
familiar theorems due to Sturm that the roots of i/ 0 [u,(h, X)] = 0 and 
M,[u 0 (b, X)]=0 separate each other, and that M,[u,(b, X)] and 
^ X)] change sign when they vanish.* 

Accordingly by (27) it is clear that if, [«,(6, X)] alternate, in sign at the 
values X,, X,■ • ■. Either if, [u„(6, X)] is positive at X,, X,, ... and negative 
at X,, X 4 , • • •, or vice versa. Thus two cases arise: 


Case I l****\-h-- 
l*<0atX,,X„..., 


when if, [u,(6, X,)] >0. 


Case II 

[ $ £ 0 at X,, X ( , - 


when^[ Uo ( 6 ,x,)]< 0 . 


We see that there must exist values l of X as follows. In Case I there exist 
at_le»st^two values l, say l„ l„ in each double interval (X*, X^ t ), 
*Loa dt, p. 139, 142. 
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p = 1,2, • • such that 

* *i<\*+ j» 

and at least one l t such that 

\ = < V 

In Case II there exist at least two values l t in each double interval 

(V-i • K+t )• P - 1 • *- ■• • 8nch that 

= ^ = 

For this deduction it is necessary to know (see § 2) that <f> preserves its sign 
only at a double value of l which we count as two. 

It remains to prove first that there exist only the two values l t in the 
double intervals; secondly that there exist in Case I either two values /,, l % in 
( — oo , X,) such that 

/, £ X, ^ /, < X, (Subcase I 6 ), 

or one value /, such that 

X, - /, < X, (Subcase IJ; 

and lastly that there exists in Case II only one value /, in ( — oo, X,) such that 

l t - (Subcase II,), 

or none (Subcase llj. 

Let us take up the first point If there exist additional values l in some 
double interval (X ? , X f>| ), there will be at least four, since and 

have the same sign. If there is no double value, there must at least two simple 
values fall within one of the intervals (X f , X^,) or (A^,, X f+| ), at which B<f>/B\ 
is of opposite signs. But this is impossible since by the last section ty/$X has 
at all such values within one interval the sign of JW 0 [u 0 (6, X)], which is invar¬ 
iable within the interval. On the other hand if a double value exists, it must 
fall at by (14); then <f> in the neighborhood of X f+1 is negative by §2. 
Other values l besides the double value would then imply at least two roots 
l existing in (X f , X J+1 ) or (X f+I , X f+1 ), inasmuch as <f>(\) and <f> (X f ) are nega¬ 
tive. This is impossible, as we have just seen. 

Thus the first point is proved, and the second and third may be treated in the 
same manner. 


§ 4. Discrimination between the four sub-cases. 

We have Case I or Case II according as M x [u 0 (6, X,)] >0 or M x [u 0 (6, X,)] <0, 
and under each of these subcase a or 6 according as <f> is of the same sign as 
M x [u 0 (&, \)] or opposite sign, for large negative X. This is evident from (80). 
It remains now to determine the sign of these quantities. 
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&HF* °fM x [«,(&, X,)]. 

We have X)] =0 for X = X, but not for X < X,. By theorems 

of Sturm • u 0 (x t X) does not vanish for o<x<i and X^X,. There are 
four sub-cases : 

( 1 ) *o* 0 , 7 o * 0 . 

Since y 0 u' 0 (b, \) + $ 0 u 0 (&, X,) = 0, we have by (7) 

But u„(4, \) has the same sign as u,(a, \) = a,. Hence the sign of 

M, [“.(*. \)] “ Aat °/ a ./v 

( 2 ) o t * 0 , 7,-0. 

Here we have «,(4, X,)-0, and 4(4, \) wiU have the opposite sign to 
= V Therefore the sign of if, 0,(6, X,)] U that of - a^,. 

(8) ®o = 0, 7, + 0. 

Here, as in case (1), 

jr,[ s( 6,x,)]-M^L}. 

'0 

But u,(b, \,) has the same sign as 4(o, X,)_ _ /9 0 since U ,(a, V)- a„_ 0. 
The sign of M, [»,(4, X,) j is that of - 0Jy o , 

(4) a. = 0, 7 o -0. 

Here, as in case (2), 

7,4(6. m. 

where 4(4 ,\) has the sign opposite to 4(o, X,) = since u 0 (a, X,) = 0 
and u„(4, X.) = 0. The eign of -/V,[u,(4, X,)] is that o/-/9„4 0 . 

Sign of <b(\)for large negative X. 

It remains to determine the sign of <4(X) for large negative Let us 
assume first that a 0 = 4 = 0 . 

We have, in view of the relation (10) between the solutions of (1), the 
equation 

*.(*, *)-(«+»£ x)]1 ) «.(*, x). 

By means of ( 8 ) we determine the values of the constants m and n and 
•Loo. dt, p. 140. 

Trrnm. Am. Math. 8oa IS 
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have * rJ 

«,(*.>■)=(-J+X (^w) u ' {x ' x) ' 

If we substitute this expression for u,(x, X) in (11), we find 

H x ) = yi u o( b > x ) + x ) 

(31) + *l(”S + I [s(?x)p)“- (&,x) + «.(i,^] 

Now let us recall that for x > a 

lira u' (x, X) «= lira u # (x, X) = oo, 

A—• A—• 

v,m + *>■ 
a— u # (x, X) 

It is then apparent that <£(X) has the sign of (<* 0 7i — 7 ®«iK(&» X )l a o when 
X is large, if a 0 y x - y 0 a x + 0. But *;(*, X) has the sign of u 0 (a, X) = a 
Hence for large negative X the sign of *(X) *« that o/(a 0 y x - y 9 a x ) % provided 

that a 0 7, - 7® a , + °- T lt , . v 

jf ao = o, the above statement remains true. In this case we express u 0 (x, 

in terms of ti,(x, X) since a, 4= 0. 

Let us next attend to the case where a 0 7, - 7« a , = 0 but a o + °* ln tblB 
case we transform the second of conditions (2) so that - 7, - <> by linear com- 
bination. Furthermore we choose a multiplier so that (7) also holds, whence 
0 X = l/a 0 , 5, = l/7 # . 

When these values of <*„£,, 7,. are substituted in (31) we get 

*(*)■= 7.[u.(x, xjT ,u,(6,x)+ V&Tm) 


+s «rtv^)?“» (6 ’ x) - 2 - 


Now u t (b, X), U,(b, X) have the same sign as v„(a, X) = Therefore by 
(32), for large negative X, <f,(X) ha, the sign of a„ 7o in the case 0,7, -7,<*, = «- 
o 0 4= 0, if the second condition of ( 2) is so cAosen tAaf 

Ol =0, 0, = ^, 7, = 0, 

Finally let us consider the case where a 0 y x - y .a, = 0 and a 0 = 0. In this 
case a, is not zero since a o 0 x - ft a, « 1. By symmetry it appears that/or foye 
ncyatitw X, *(X) has the sign of - 0,7, */ 6otA a #7l - 7 ®<V= 0 and a, - 0. 
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It should be noted that the above classification depends on the selection of the 
first condition £„[*(«)] = [«(&)] out of the double infinity of conditions. 

§ 5. The oscillation theorem. 

Theorem. The solution u p (x) of (1), (2) which corresponds to X = l p 
vanishes p — 1 , p* or p + 1 times for a <x = b in accordance with the follow¬ 
ing table , in which 

5„ - 7.«, - <vr,> K - tA - 0,y„ <"«<* K=~ $,K-‘ 


I. 

a o >0, p = 2m. 

u ' (a) 2 K- 
“,(«) " ‘ 

+ 1 times, 



«,(«)>’ 

times, 


p = 2m 4 1, 

silDajr. 
«,(“) ’ 

p times, 



s(«) > ’ 

p — 1 times, 

II. 

«o<°» />- 2 m, 

«,(«) 

^ times, 



“.(«)' * 

/> — 1 times, 


/> = 2 m + 1 , 

«,(«) 

p + 1 times, 



^)^ K . 

p times, 

III. 

5 o = 0 » K > 0 » p = 2m; 


p times, 


p - 2m -f 1; 


p + 1 times, 


a 0= < 0 » p — 2m; 


p -f 1 times, 


p = 2 m + 1 ; 


p times. 


Proof The conditions may always be written in the form f 

(33) + 

a,w'(a) + & t u(a) = — u(b) + 5, ?/ (6). 

•At a doable value we can take any solution *,(x) as correspond i dr to Also if u,(n)=0, 
we make the convention that u'(a)/u,(a) = — x. 

flf the conditions be taken in this form at the start, no transformation is necessary. To obtain 
the first of these new conditions from ( 2 ), multiply the first by — ; 0 , the second by >,, and add. 
The second of these conditions is then any condition so taken as to satisfy (7). 
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Then 4>(\) has the sign of - 5 0 7, = 5 0 at X, and of (a 0 y t - y Q a l ) = a Q 
for large negative X. This follows from the last section. With these new con- 
ditions we therefore have case I B or II a of the existence theorem. Accordingly 
by the existence theorem, the value l p lies on the interval (X p . X p+| ). 

By well-known theorems of Sturm, w 0 (x, X) will vanish p times on a <x <6 
for X SX^X^,. But the roots of u p (x) and u 0 (x, l p ) separate each other, 
or else coincide. .Hence u p (x) vanishes p — \ % p, or p+ \ times on a <* = &. 

Consider first the case 5 # > 0, and assume that u p (a) and u p (b) do not vanish, 
a retriction which is easily removed. If we have p = 2m and u p (a)/u p (a) ^ K , 
we proceed as follows: Since, by definition the equation v 0 ( a, l p )/u 0 (a t l p )=K 
holds, the function u p (x) vanishes at least as often as u 0 (x , l p ) and hence p or 
p + 1 times. But also from (33) we have a 0 u p (a)[u' o (u)lu r (a)-K]=u p (b). 
Thus u (a) and u p (b) have opposite signs, and u p (x) vanishes an odd number 
of times. This excludes the first possibility. Hence u p (x) vanishes p+ 1 times. 

Likewise if p = 2m and u p (a)lu p (a)> JT, we see that «,(*) cannot vanish 
more often than u 0 (x, l p ), and therefore vanishes p - 1 or p times. However, 
w (a) and u (6) have like signs, so that u p (x) must vanish p times. 

9 Also if p = 2m + 1, u p (a)lu p (a) £ K< we find that «,(*) must vanish p 
times. 

Also if />-2m+ 1, u p (a)/u p (a)> K, we find that u p (x) must vanish 
p — 1 times. 

A precisely similar discussion is possible when a # < 0. 

When one has ^ = 0, /8 0 > 0, the first condition (33) shows that t*,(a) and 
u (6) have like signs. Also u p (x) has at most p or p- 1 zeros, since 
u[( X , l ) vanishes for x = a. From this follows the table for this case. 

In a like manner one may discuss the case 5 0 = 0, < 0. 

University or Wisconsin. 

OH. 28, 1908. 
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ON THE SOLUTIONS OF ORDINARY LINEAR HOMOGENEOUS 
DIFFERENTIAL EQUATIONS OF THE THIRD ORDER 

By Georoe D. Bikkhoff 

Introduction- Given a linear differential equation of the third order 

(1) y" + P'f + + r, J = °' b ), 

in which the coefficients/), q , r are real functions of z, continuous together 
with their derivatives of all orders, but otherwise unrestricted ; what is the 
general character of the solutions and how do they depend on these coeffi¬ 
cients? This rather interesting question lies almost wholly untreated, and it is 
the aim of the present paper to consider it. 

The analogous question in the case of an equation of the first order 

y +P!/ = 0 

is at once answered. By means of the explicit form of the general solution 
y = c e - '(c, a constant), 

we see first that y cannot vanish, unless it does so identically. Also any func¬ 
tion yi which does not vanish is the solution otsuch an equation,.namely 

'-S'-°- 

Thus the general characterization of the solutions has been obtained. The 
exact nature of the dependence of the solutions upon the coefficient/) is exhib¬ 
ited by the explicit formula. 

If the equation be of the second order, 

y" + p</ + w = o, 

the general solution has the form 

c i!/i + °i!/^ (c„ c*, constants), 

where y, and y a are any pair of linearly independent particular solutions. It 

(103) 
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is an immediate consequence of the fundamental existence theorem for ordinary 
differential equations that 

w = yiy* - y*yl 

cannot vanish. For suppose that this expression did vanish for x = £. In 
this event we could determine a and £, not both zero, so that the two equa¬ 
tions 

ayi(f) + f>Mi) = 0. °yi(f) + 0y»(f) = 0 

would hold. The solution 

y = ayi + 0!/t 

and its first derivative would then vanish at * = £. But, by the existence 
theorem referred to, there is but one such solution, namely y = 0. Accord¬ 
ingly y, and y % would be linearly dependent, contrary to our hypothesis. 
Thus IF cannot vanish. Furthermore if y, and y a are any functions for which 
IF * 0, these functions will form a pair of linearly independent solutions of 
the linear differential equation of the second order 


y" y y 
y" sri yi 
y* yi y* 


the coefficient of y" being precisely —IF. This coefficient can be made unity 
by division of both members of the equation by —IF. 

The analytic condition IF * 0, which has been shown to characterize yi 
and y 3 . tells us that the ratio of y, to y t increases or decreases throughout the 
interval considered. Let now y, and y* be regarded as the coordinates 
of a point in the projective line. As x increases through the interval, the 
point will describe the line in one sense continually. The elementary Sepa¬ 
ration Theorem concerning the roots of y t = 0 and y, = 0 is an evident con¬ 
sequence of this fact: the roots of y, = 0 and y 2 = 0 will separate each other. 
The above results give a characterization of the solutions, and thus answer 
the first part of our question for a linear differential equation of the second 
order. 

As the most important of the theorems which deal with the dependence 
of solutions upon the coefficients we cite the following Comparison Theorem : 
suppose two equations 

y" + y,y = 0 and y" + = 0, 
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to be given ; then there will be at least one zero of any solution of the first 
equation between two zeros of any solution of the second, or less precisely, 
an increase of q in the equation 

y" + qy = 0 


brings the zeros of the solutions closer together .• 

Very analogous theorems exist in the case of the equation of the third 
order, and it is with these that we are concerned here. 

1 . Geometrical Interpretation. If one writes 


W = 


y\ Vx Vs 
y\ y'x y's 
yi' y'i y's 


a necessary and sufficient condition that y*, y it »/ s form three linearly inde¬ 
pendent solutions of some linear differential equation of the third order ( 1 ) is 
seen to be W * 0 , just as the analogous fact was seen in the case of the second 
order equation. An explicit formula for W is obtained by noting that one has 


dW 

dx 


y\ y* y* 
yi y* yi 
yi" yi" yi" 


-pW, 


by ( 1 ), so that iris given by the formula 

(2) W=ce-f* t (c £ 0). 


•Take for example the equations y" + X,y a 0 and y" + X r y = 0, where X| and X, are 
positive constants, X| greater than X,. The Interval between zeros Is fixed In both cases, 
»/^X| in the first and »/^X* In the second case. 

The theorem Is more general than at first might appear since any equation of the second 
order may be given the normal form, 

y" + 9 y =0. 

by a simple transformation. 

The reader Is referred for the proof of this and similar results to articles by Professor 
Bdcher In the Bulletin of the American Mathematical Society, vol. 4, pp. 295-313, pp. 3C5-37C, 
1897-1898, as well as to the original paper of Stunn, Journal Je Mathematique. vol. 1, pp. 106- 
186, 1836. 
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To a set of linearly independent solutions y x , y 2 , y 8 of a given equation 
(1), whose general solution is then 

W\ + c*y* + «*y*» 

there will correspond an integral curve C defined by the equations 

M> x = y x (*), w t = y,(*), ^3 = 3/s (^). 

in which toj are the homogeneous coordinates of a point m the pro¬ 

jective plane referred to a certain triangle of reference. It is clear that any 
other integral curve D obtained by making a different choice of the three lin¬ 
early independent solutions will be projectively equivalent to G\ since the new 
solutions can be expressed linearly in terms of y x , y t . .Vs- 
The equation of a tangent line to C is 


to, w t 
'J\ V* 


1Tj 

y* 


= o. 


(3) 


I yl y* yi I 

The homogeneous coordinates of this line are therefore 

X, = X, = X, = 


the divisor W being so chosen that x„ x„ x, are solutions of the adjoint 
equation to (1) : 

(4) 'J" - (py)" + (?y)' - r, J = o- 

The fact that x„ x„ x,are solutions of (4) may be verified by substitution. 
The relation between (1) and (4) is entirely reciprocal in its nature, for 
equation (1) is also the adjoint to (4). Furthermore, the functions x„ x„ x, 
are linearly independent solutions of (4) since otherwise the curve C would 
reduce to a point or straight line, and y„ y„ y, would be also linearly de¬ 
pendent. The reader is referred to Darboux, Thtorie des Surfaces, vol. 11, 
book IV, chap. 5 for a discussion of the properties of tho adjoint equation. 
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The identities 


(*) 


yi*i + y& + = o, 

y\ z \ + y& + y'i z s = °. 
yi' z i + y?*t + y?*s = h 


are obvious on substitution of the expressions given in (3) for z lt 25 , 23 . 
These equations serve also to define z„ z*. Zj in terms of y„ y*, y s . 

Returning now to the curve (7, we see that it is a continuous curve in the 
projective plane (i.e. if the projective plane be mapped on the ordinary 
Euclidean plane so that P 0 is a point of C in the finite plane, then C is con¬ 
tinuous at P 0 in the ordinary sense) since y lt y Jf y 3 are continuous functions, 
not all simultaneously zero ( W * 0). Likewise C has a continuous! y turn¬ 
ing tangent in the projective plane (i.e. if the projective plane be mapped on 
the ordinary Euclidean plane so that P 0 is a point of C in the finite plane, 
then the direction of the tangent line varies continuously along the curve at 
P u ), since z lt z 4 , z ^ are also continuous functions, not all simultaneously zero. 
Thus C is a regular curve in the projective plane. The fact that y„ y 4 , y 3 
have derivatives in z of all orders may be taken account of by saying that C 
is completely regular. 

The significance of the condition W* 0 is that there are no points of in¬ 
flection. In fact, by definition, the tangent at such a point has contact with 
the curve of higher order than the first, and hence if 


a,ic, + a 3 w t + a a w 3 = 0 
is the tangent line, one has simultaneously 

«iyi + <**y* + <*$ys = o, 

«iyi + a *yJ + a *yl = o, 

«.y7 + a «y'i + y? = o. 

This is possible if and only if W vanishes. The curve C may therefore be any 
completely regular curve without any point of inflection. The reader will re¬ 
member that the only condition on y x% y 4l y 3 that they form linearly indepen¬ 
dent solutions of some equation ( 1 ) is the condition W * 0 . 

If we consider the projective plane as lying in the ordinary Euclidean 
plane, die curve C possesses a tangent which rotates continuously in one 
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direction until the curve passes through the 'line at infinity’ which is the im¬ 
age of any arbitrary line in the projective plane. It then returns with a tan¬ 
gent rotating in the reverse direction (consider for example a conic, which 
has no point of inflection). 

A transformation 

y = Xy, (X ^ 0), 

followed by a division by X, leaves (1) linear and of course does not change 
the curve C. By choosing 

X = e-»"-\ 

the coefficient of the second term of (1) will reduce to zero and that equation 
will take on the simple form 

(6) !/" + VJ * + ry = 0 , 

to which the adjoint equation is 

(7) \T + vJ + (?' - r )'J = °* 

It may be observed in passing that this last equation might also have 
been obtained from (4) by a like transformation in which however 

X = . 

A simplification introduced by giving our equations this normal form is that 
W reduces by (2) to a non-vanishing constant. 

From this point forth wo shall often employ our equations in this simplified 
form. The curve C may then be looked upon as given either in normalized 
point coordinates by (6) or in normalized line coordinates by (7). 

The above interpretation of the solutions of an ordinary linear homogen¬ 
eous differential equation by means of a curve is a well-known one. The sec¬ 
ond and third chapters of Professor Wilczynski’s Projective Differential Ge¬ 
ometry contain a full development of it. 

2. General Separation Theorem. The above geometrical repre¬ 
sentation furnishes us at once with the negative conclusion—not to be seen 
very readily without use of this representation perhaps—that it is possible to 
choose (a) the differential equation (1), and (6) three linearly independent 
solutions y lf y„ y s , of the same, so that the zeros of y lf y 9 , y s will succeed each 
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other in any arbitrarily prescribed order. It will suffice to show that there 
exists a completely regular curve C without points of inflection which cuts the 
sides of the triangle of reference in any arbitrarily prescribed succession, since 
such a curve is the integral curve of some equation (1). The existence of such 
a curve may be readily seen as follows: Let C be drawn up to a point P not on 
a side of the triangle of reference; the curve C may be extended beyond P 
without the introduction of points of inflection, so as to cut any assigned side 
of the triangle. The method of extension is given in the adjoining figure 



(Fig. 1). By a succession of such extensions, C may be made to cutu>, = 0, 
to i = 0, w 3 = 0 in the desired order. 

It is only when wo consider the solutions y„ y t , y 3 and z„ z t% z 3 together 
that we obtain a Separation Theorem concerning the order of the zeros of 
these six functions. 

General Separation Theorem. Between any two successive zeros of 
Hi (or of z t ) there are an odd number of zeros of y L and z , together , where 
(*, k, l) is any permutation of ( 1, 2, 3). 

Proof. If we can prove that between any pair of successive zeros of y, 
there are an odd number of zeros of y*and z 3 together, the theorem will follow 
by considerations of symmetry and duality. We may suppose without loss of 
generality that the line w, = 0 of the projective plane goes over into the line 
at infinity in the Euclidean plane. The arc of C corresponding to values of 
* between the two values for which y, = 0 is then a single branch of the curve. 
The line w t = 0 is some line in the finite plane. The geometrical interpreta¬ 
tion of the condition y 3 = 0 is then that the curve Ccuts this fixed line i 4 ’. = 0 ; 
the interpretation of the condition*, = 0 is that the tangent line is parallel to 
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the line w t = 0 , in other words, passes through the intersection of to, = 0 and 





Nov? (Fig. 2) if the arc enters on one side of w t = 0 in this plane and 
leaves on the other there are an odd number of zeros of y 4 and an even num¬ 
ber of zeros of z 3 ; otherwise there are an even number of zeros of y 4 , and an 
odd number of zeros of z 3 . In either case the theorem holds. 

Self-Adjoint Equations. If the integral curve C is self-dual, equations 
( 6 ) and (7) are identical since the normal form ( 6 ) is unique. The con¬ 
dition for this is plainly , 


2 


In this case, which possesses especial interest, z„ z 7% z 3 must be linearly 
expressible in terms of y„ y 4 , y,: 

*i = c»iyi + c^y, + c I# y„ 

= Ciiyi + + Cttf/s. 

*s = cj,yi + c»yt + cssys- 

Since the solutions z„ z„ z 3 are linearly independent, the determinant of 
the coefficients is not zero. Hence one finds from the first equation (5), by 
substitution of the expressions above for z 4 , z 3% that 

Ciiyl -1- Ciiyl + +• (q, + Cu)!/^ + (Cn + + (c M + c 4 i)yiy 2 = o, 

the equation of a conic unless 

C n = C„ = Cjj = 0, C*3 = —Cm, Cj, = —C,j, Cj, = —C*,. 

These conditions cannot all hold since, if they do, the determinant of the 
coefficients will be zero. The integral curve C must be an arc of the conic , or 
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the entire conic. In the latter case as x increases from a to b, the correspond¬ 
ing point P may traverse the conic in one sense any number of times, making 
q complete circuits and a partial circuit. 

The above considerations lead us at once to the following theorem : 

Separation Theorem. Self-Adjoint Case. There exist solutions of 
any equation (1) reducible to self-adjoint form which nowhere vanish in (a,b). 
If y x and y t are solutions of the equation with at least one zero in (a, b ), the 
zeros of ji and y t either separate each other singly or in pairs. 

Proof. In fact, there exist straight lines which do not cut the conic, and 
therefore solutions which do not vanish, since the condition 


a iyi + a i!/t + <*sys * 0 

is equivalent to the condition that the conic does not cut the line 

«!*«■, + opCf + a s w 3 = 0. 

Also, of two straight lines which do cut the conic, say in M, and jV,, 
y it respectively, either M x and M t separate AT, and on the conic, in which 
case the zeros of the corresponding solutions occur alternately as wc traverse 
the conic; or these pairs do not separate each other, in which case wc have 
first two zeros of the one solution, then two of the other, and so on. 

It C is merely an arc of the conic no solution vanishes, more than twice 
in ( a,b) although the theorem still holds. 

Transformation of the Independent Variable . In obtaining the normal 
form (6) only a multiplicative transformation of the dependent variable, 

.y = Xy, (X*0), 

was made use of, while it is known that(l) also remains linear when any 
transformation of the independent variable 

*=*(*). (f*0), 

is made. From the standpoint of the geometrical interpretation, the second 
transformation changes the parameter of the curve. 

What is the simplest form of the equation obtainable by a suitable com- 
bination of both transformations? 

First, if the equation be reducible to self-adjoint form the locus of the 
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point (y„ y 4 , y 3 ) is a conic, and a suitable transformation of the variables 
will reduce //„ y 4 , 1/3 to the form 

y, = cos x, y* — sin x, y a = 1 , 

no matter how many times the integral conic be traversed. Hence a normal 
form in this case is 

y"' + 7 = 0, (a x £ b). 

Likewise, since the equation ( 6 ) may be written 

+ q<J + (^ + = 0 , 

one sees that the transformation which reduces the self-adjoint equation 

(8) aT + vf + = 0 

to the preceding normal form will at the same time reduce (G) to an equation 

(9) ~J« + 7f + H1 /= 0, (a'sisi). 

The Forsyth-Laguerre normal form is 

/'•+J?y = 0; 

in this case the self-adjoint equation ( 8 ) has been transformed to the equation 

y"'=0 

with solutions 1, x, x*. Since the equations 

ic,= l, tr* = x, tc, = x* 

give us each point of the conic once and only once, this transformation is not 
proper unless the interval (a, b) is so small that the integral curve of ( 8 ) is 
an arc of the conic. 

3. Separation Theorem for Regular Intervals. The somewhat 
indefinite character of the first Separation Theorem is due in large measure to 
the fact that the integral curve C may have a very complicated form ; in par- 
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ticular it may have double points and double tangents. A regular interval 
(a, b) is deliued to be an interval such that the corresponding integral curve C 
is either an oval, or has no double points or double tangents except where C 
touches but docs not intersect itself. On account of the fact that double 
points and double tangents are dual notions the interval (a, b) will be regular 
for the adjoint equation (4) if it is regular for (1) itself, and conversely. It 
is obvious that any interval can be separated into a finite number of regular 
intervals. 

Separation Theorem for Regular Intervals. If (a, b) is a regular 
interval there will exist a solution y x of ( 1 ) which is of one sign in (a, b) and 
a family of solutions c, y 4 + such that the zeros of any two members of the 
family separate each other. 

Proof. *1 lie theorem is clearly true if the integral curve is an oval, so 
that this possibility is at once disposed of. 

Assume first that the arc AB corresponding to the interval ( a,b) does 
not cut some line of the projective plane, which is taken to bo the line at in¬ 
finity in the Euclidean plane. On account of the hypothesis that (a, b) is 
regular, it is clear that AB will be spiraliform (Fig. 3). We are not 
excluding the possibility that A or B or both lie on the line at infinity. 



Flo. 3. 


We shall now show that there exists such a line in every case. Construct 
the tangent l at a point P of AB. As P moves from A to B along the arc 
A B, the line / will not at tiret cut A P. There must, however, be a least value 
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of X, say |, such that for slightly greater values the corresponding tangent 
l will cut APi otherwise the tangent line at B does not cut AB and we are 
brought back to the previous case. Furthermore, the tangent corresponding 
to x = £ obviously passes through A. If this particular tangent line be chosen 
as the line at infinity, it is clear that AP will correspond to a single branch 
o(AB. without inflection point or double point, of course, and that accordingly 
there exist lines Z, which do not touch AP. Now take such a line Z, as the line 
at infinity, and the arc AP will have the form indicated in Figure 4. 



From this figure it is clear that, since there are no double points of inter¬ 
section, the entire arc AB must lie between the arc AP and the tangent atP. 
Hence this case is reduced to that first considered. 

Thus the integral curve C is spiral i/orm in all cases. It should bo 
noted that an oval is to be looked upon iu> a limiting form of a spiral. 

Any line which does not have a point in common with AB will correspond 
to a solution y, f which does not vanish for a £ z £ 6. Likewise all the lines 
through a point in the interior of the integral curve (see shaded region in 
Fig. 4) will correspond to a linear fhmily 

C *!Z*+ ° si/t 

whose zeros separate each other. 

4. An Auxiliary Formula. We shall now derive an auxiliary 
formula upon which for the most part the further theorems will be based. 

Let y,, y t , y 3 be three linearly independent solutions of a given linear 
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differential equation (l),and z lt z it % the corresponding solutions of the adjoint 
equation (4), and let us write 

(10) = y, (z)*i(f) + y«(*)*t(f) +y,(*)z,(f). 

The relations (5) are then 

(11) *((.()= Mi. 0=0, M0i) = U 
where our notation is 

M x . i) = . Mz, {) = ~, etc. 

Take now (1) in the special form (6) and consider the modified equation 

(12) y'" + ?y + (r + R)y = 0, 

which may be written in the form of a noo-homogencous linear differential 
equation 

(18) y'" + qif 4- ry =/ = _ /ty. 

The general solution of this equation may be written 

(14) y = c iyi + c,y« + ejy, +J<Hz, i)Ai)d(. 

In fact by direct differentiation and reduction by means of (11) one finds 

y' = cyi +vj',+ wl + j'uz, 0Ai)d(, 


y" = Ciyi' + <v/i + «sy> + J'ux, OAtyit, 
y"'= <wl"+ <hyi" + <v/!'+A x ) + J’uz, 0A0<t(- 

Substitute these expressions for y and its derivatives in (13). This equa¬ 
tion will then reduce to an identity by virtue of the fact that y„ y s , ,,, «„d 

! r ° solutlons of ( 6 )> which P ro ves that the expression for y given by 
(14) is the general solution of (13). 3 
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The function 

V = ctj/i + %y t + Cjy, 

is a solution of ( 6 ), and from the above equations we see that it satisfies the 
conditions 

(15) y(a) = »?(«), y\a) = V(a)» *S\ a ) = */'(«)• 

Thus we may rewrite (14) in the final form 

y = v-J h*. f) *(?) y(f) <*f. 


(16) 


in which’;/ and 17 arc corresponding solutions of ( 12 ) and ( 6 ) respectively, 
related by equations (15). This is the desired formula, and it represents, 
of course, nothing more than the solution of a special non-homogeneous linear 
differential equation. 

5. Test for Regular Intervals. A regular interval (n, b ) is said to 
be of the first kind if the tangent at A does not meet the nvcAB at any second 
point (see Fig. 4), and of the second kind if the tangent at B docs not meet 
the arc AD at any second point. According to this definition a regular in¬ 
terval may be of both kinds. We shall take the interval (a, b) to be of the 
first kind, stating, however, the results for both cases. 

It follows at once from the definition, as has been noted, that if (a, b) is 
a regular interval for equation (1), it is also a regular interval for equation (4), 
the three solutions z lt z it % of (4), given by (3), now being interpreted as the 
coordinates of a point. Let the corresponding point locus be the curve D. 
Then D is obtained from C by a polar reciprocation and a suitable projection. 
Furthermore, if («, b) is regular and of the/rsf kind for (1), no lino through 
B is tangent to the curve C at any other point than B (sec Fig. 4). There¬ 
fore, in the curve D, no point on the tangent through B lies on the curve Z>, 
i. e. D is of the second kind. 

A necessary condition that (a, b) is a regular interval is that all the points 
of intersection of the tangent l at any point P of AB lie on one and the 
same side of P. Thus in the preceding figure (Fig. 4) all the points of inter¬ 
section of / lie in AP*x\& none in PB. To prove the condition sufficient we 
have to show that, if it is satisfied, the integral curved// can have no double 
points or double tangents except where the curve C touches but docs not inter- 
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sect itself. Suppose there were such a double point M = N where N lies in 
MB. Then the tangent at M cute the curve at N, and the tangent at JV cuts 
the curve at .1/. This is impossible if the condition is satisfied. If there 
were such a double tangent at M and JV, where N lies in MB, the tangent 
line in the vicinity of M would cut the integral curve in the vicinity of W, 
and likewise the tangent line in the vicinity of N would cut the integral 
curve in the vicinity of M. This is also impossible if the condition is sat¬ 
isfied. Thus the validity of the condition has been demonstrated. Further¬ 
more it is clear from the figure that (a, b) will be regular and of the first kind 
if no point of intersection of the tangent l lies in PB. 

The analytic phrasing of the geometric condition can easily be given in 
terms of the function <f>(x, £) defined in (10). For from the equations (11) 
it follows that the solution of ( 1 ) which corresponds to the tangent having a 
point of contact whose x is £, is <f>(x, £). Accordingly </>(x, £) will not 
change sign for x>£, since the tangent at £ does not cut the integral curve for 
points whose x is greater than £. Moreover, <f>(x , £), regarded as a func¬ 
tion of x, has a minimum at the point x = £, by (11). Therefore the analytic 
form of the condition is the following: the interval (a, b) is regular and of 
the first ( second) kind provided that <p(x, £) is positive or zero for x > £(x< £). 

We are now in a position to develop the following test: If ( a , b) is a 
re/ular interval of the first ( second) kind for the equation (fi ) it is also a regular 
interval of the same kind for the modified equation (12), provided that the 
inequal it•/ R £ 0 (R 2 0 ) obtains. 

Proof. The proof can immediately be made by means of the auxiliary 
formula (IB) in which we take 

V = <t>(x, a), 

where a remains to be specified. This choice of 17 is possible since <£(x, a) 
is a solution of ( 6 ). We obtain then 

(17) y = 

when g is that solution of the modified equation ( 12 ) which by (15) satisfies 
the conditions 

(18) g(a) = 0 (a, a) = 0 , y'(a) = *,(«, a) = 0 , //"(a) = 0 ? ( a , a) j. 
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Accordingly y is precisely the solution of the modified equation (12) which 
plays the same rble for (12) that <£(x, a) does for the equation ( 6 ). There¬ 
fore if (a, b) is not regular and of the first kind for the modified equation 
there exists some a such that the corresponding y changes sign for x > a. Let 
x 0 be the least value of x for which this is true. Then we have the equation 

0 = *(*,, a) - o i?(f) y(f) Jf. 

But by hypothesis (a, b) is regular and of the first kind so that </>(x 0 , a ) 
and 4>(x 0 , f)(x„> £) are positive or zero. Moreover i?(£) is negative by 
hypothesis, and y(f) is positive for £ between a and x 0 by (18). Thus we 
have the sum of two positive or zero quantities equal to zero. Of these two 
quantities the second given by the definite integral must be negative; for the 
first factor <*>(Xo. {) of the integrand can only be zero for isolated values 
of x since <f>(x„ £) is a not identically zero solution in £ of the adjoint equation 
to (G), given by 

yi(*o)*i(f) + *(*)*(() + u*< r o) z $(t) i 

likewise the third factor of the integral cannot vanish except at x ( and 
£ on account of the way in which x 0 was chosen ; finally the remaining factor 
R(t-) m xy be taken to be not identically zero. Hence the above equation is 
impossible. 

The following special case of the test is important: The interval (n, b) 
i9 a regular interval oj the first ( second) kind for an equation 

(19) y'" + ?</ + (§ + i?) y = 0 

provided that R £ 0 (R £ 0) in the interval (a, b). 

In this case it is only necessary to identify the equation (G) with the self- 
adjoint equation : 

y + W'+Jjr-O. 

for which any interval (o, b) is regular and of both kinds, and to state the 
test for this special case. 
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Since any equation ( 6 ) can be written in the form (19) we have a method 
of separating any interval into a sequence of regular intervals the first of one 
kind, the next of the other, and so on, the end points of these intervals being 
given by those roots of R = 0 at which R changes sign. 

The regular intervals thus chosen have a special geometric property which 
however I shall not stop to prove: The successive osculating conics to the in¬ 
tegral curve lie one within the other at all points of such an interval , and at the 
end points the curve C and the osculating conic hgperosculate. 

6 . Intervals of Oscillation. Comparison Theorem. It remains 
to develop the nature of the dependence of solution and coefficients. Anal¬ 
ogous to the notion of the interval of oscillation as the distance between suc¬ 
cessive zeros of a solution of an equation of the second order, is the notion of 
forward and backward intervals of oscillation. 

Consider the totality of those solutions of (1) which vanish at x = a ; 
these solutions will form a two parameter linear family. Suppose that they all 
vanish again in the interval (a, b). The least interval (a, 0), where 0 is 


greater than a, which has the property that all these solutions vanishing at 
x = a will vanish again in (a, 0) is called the forward interval of oscillation 
at x = a. Likewise the least interval (0,a) where 0 is less than a, which has 
the property that all these solutions vanish again in (0, a) is called the back¬ 
ward interval of oscillation at * = a. 



It may happen that not all of the solutions which, vanish for x = a vanish 
again for x > a ; in this case a forward interval of oscillation at x = a does not 
exist. Likewise a backward interval of oscillation may not exist. 

Let us now consider a regular interval (a, b) which, as before, is taken to 
be of the first kind. The integral curve AB will then be a spiraliform curve 
with the point B in its interior (see Fig. 5). For such an interval (a, b) a 
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forward interval of oscillation never exists, unless AB is an oval or touches 
itself. Thus in the figure let 1* be any point whose x is a ; then the line PL 
corresponds to a solution which does not vanish for x > a, since PL does not 
meet PB at any second point. 

The backward interval of oscillation may however exist. Thus in the 
figure, the tangent to the curve AB at P first meets the arc ^lPat Q ; if the 
point Q corresponds to x = £, then (/3, a) is the backward interval of os¬ 
cillation at a. Any straight line through P meets QP again and this is not 
true of any shorter arc P(f . Since 4>(x t a) corresponds to the tangent at a, 
the quantity A is the greatest root of <f>(x t a) = 0 less than a. If the tangent at 
Pdoes not meet AP again, the backward interval of oscillation does not exist. 

It is apparent from this figure that any solution vanishes at most in two 
distinct points in a backward interval of oscillation (£, a) on such an interval 
(a, 6). Furthermore, if a given solution y vanishes for any value of x > a, 
it will necessarily vanish in (£, a). This is obvious from the figure also. 

Similar facts are true of forward intervals of oscillation if (a, b) is of the 
second kind. 

Comparison Theorem. If (a, b) be a regular interval of the first 
( second ) kind for the equation (fi), and i/ i?S0(/l«0), the backward 
( forward) interval of oscillation (/3,,a) [(a,/9,)] at x = a for (12) is smaller 
than the like interval (/9,a) [(a,£)] /or (6). 

Proof. To prove this theorem we shall again employ the auxiliary form¬ 
ula (16). As in a preceding proof we shall take 

rj = 4> (X, a) 

and thus obtain again (17). Suppose now if possible that the theorem is not 
true, so that we have 0 2 As we have already noted, the function analo¬ 
gous to <f>(x , a) for the modified equation (12) is precisely y of (17), which 
must then vanish for x = £, but not for < z < o, by the analytic condition 
obtained above, since (a, b) is regular for (12) also. The two functions y 
and 4>(x, a) will therefore be positive throughout (£, a) since neither vanishes 
within (£, a) and since both are positive in the vicinity of a by (11). Put 
now x = /9 in equation (17). It reduces to 

y(/S) = yV/8. f) ■«(£) .'/(£)<*{. 

where 4>(0, jj) is positive. But It(() is negative or zero. Therefore in this 
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relation are equated quantities, one positive or zero, the other actually nega¬ 
tive. This is impossible and the theorem must hold. 

We note the following result contained in the above theorem : The back¬ 
ward ( forward) interval of oscillation for the equation 

y"' + ?y + (7+ It) y = 0, 

where /? ^ 0 (7? 5 0) is smaller than that for the self-adjoint equation obtained 
by taking 11 = 0. 



Any equation (1) may be reduced to the normal form above by a known 
transformation. In the present section we have developed the dependence of the 
solutions on II, by showing that if R is negative a decrease in R decreases the 
backward interval of oscillation. In the figure (Fig. 6) we have repre¬ 
sented by a full line the integral curve C of this equation and by a broken line 
the integral curve D of the modified equation. If (&, a) represents the backward 
interval of oscillation at a for the curve C and (£,, a) the similar interval 
for D , we have > 0. For the sake of the comparison the curves 
have been taken with the same initial point and direction. It would 
now be desirable to give a comparison theorem which stated, in simple terms, 
the dependence of the interval of oscillation on the function q. If we increase 
q, however, it may be shown that the interval of oscillation may be either in¬ 
creased or diminished in length. In the self-adjoint case the dependence is 
more simple: 

Comparison Theorem. Self-adjoint case. Given two equations reduci¬ 
ble to self-adjoint Jorm, in particular to 


y"' + in/ + -f y = 0, 

( 20 ) 

y"' + qaf + \ y = 0, 
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such that q x > q* in (a, 6) ; then the backward (forward) interval of oscillation 
of the first of these equations is less than that of the second. 

Proof. In order to prove this theorem we shall prove first that the back¬ 
ward interval (/9, a) of oscillation of any self-adjoint equation 

(21) y + ^ y = o 

is precisely the interval of oscillation (0,a) for the linear differential equation 
of the second order 

(22) *" + !* = 0- 


Let y be the solution of the equation (21) such that 

(23) y(«) = y'(«) = o. y"( a ) = 1 - 

That is, y is chosen so that it is equal to <f>(x, a) by (11). As has been shown, 
the backward interval (£, a) of oscillation at x = a is given by the largest 
root of y = 0 which is less than o. 

Now we have clearly, by triple integration by parts, 


y yy"'dx = yy" - y"* + y"y - Jy'"ydx, 
and by a single integration by parts, 

J y(q</)dx =qy'-f (qx/ + q'y)ydx. 


Therefore we have 


Jy{>J" + ?.’/ + y)dx = 2 y,f - y* + qy> - f{'J" + qi/ +£ y)y dx. 

Remembering now that y is a solution of equation (21) which furthermore 
satisfies conditions (23) wo obtain 

2 yy" - y* + qy' = o. 

Make the substitution 

y = z\ y> = 2zz! t y* = 2 (zzT + zfi), 


68 



DIFFERENTIAL EQUATIONS OF THE THIRD ORDER 


123 


1911] 


in this last equation, and, on the removal of a factor 42 s it will reduce to the 
linear differential equation (22). It is clear that, as thus determined, z will 
be a solution which vanishes at x = a. And 0 will be determined of course as 
the greatest value of x less than a for which z = 0. Hence the backward 
( forward) interval of oscillation at a of the given self-adjoint equation (21) 
is given as an interval between two successive zeros , of a solution of the asso¬ 
ciated linear differential equation (22) of the second order. 

The theorem we wish to prove is now an obvious consequence of the 
Comparison Theorem stated in the introduction. 

The above reduction is not new. 

7. Oscillatory and Non-Oscillatory Solutions. The preceding 
paragraphs afford practical methods for obtaining an idea as to the distribution 
of the zeros of a solution of (1) provided the solution is known to vanish. 
The following theorem often enables one to determine whether a given solution 
is oscillatory or non-oscillatory in the interval (a, 6) i. e. whether the solution 
vanishes or not. 

Theore m. Let (a, b) be a regular interval of the first (second) kind for 
the equation (6), and suppose that R £ 0 (R £ 0). Let rj and y be solu¬ 
tions of (6) and (12) respectively , so related that 


y(o) = v (a), y'(a) = (o), y"(o) = v "(a). 

The inequality 

(24) |y| < hi 

obtains for values of x less ( greater) than a and within the backward 
( forward) interval of oscillation at x = a until a value ofx is reached for which 
y = 0 ; the inequality |y|>hl obtains for all values of x greater (less) than a 
until a value of x is reached for which rj = 0. 

Proof. If the inequality (24) fails to hold for x < a, it will certainly 
fail to hold before an x is reached for which one has 17 = 0. In fact, otherwise, 
for this x we have y = 0, by (24), and the theorem does not say anything 
about y for lesser values of x. Therefore if the 6rst part of the theorem docs 
not hold, the inequality 

(25) |y| * hi 

is true for some x < a, say x 0 , in the backward interval of oscillation (0,a), 
such that neither y nor tj vanish in (x 0 , a). 
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But y and rj have precisely the properties that they had in the auxiliary 
formula (16) so that 

y(*o) - >?(*.) = Jh**. f) R(.i) y(f) <*f- 

The loft h \nd member in this equality is of the same sign as y (r 0 ) or zero, since 
(25) holds at x = x 0 . The right hand member is of opposite sign to y(x 0 ), since 
/?(£) is negative and 4>(x 0 , £) is positive, x<, and £ being contained within the 
interval of oscillation (£, a) (see Fig. 5). This equality cannot hold and the 
first part of the theorem is true. 

The second part of the theorem is proved in an analogous way. An 
equation 

y(*o) - >»(*o) =-J *(*o. f) #(?) y(f) d f 

will be obtained if the theorem is not true, in which x 0 > a is an x for which 
|y|*|*| ™hile neither y nor 17 change sign in (a, x 0 ). In this case too /?(£) 
is negative and<£(x 0 , £) is positive. As before, the two members of this equal¬ 
ity would have opposite signs. 

The relation of the two solutions is given in the figure (Fig. 7). 



The following application is at once obvious : If rj and y are solutions 

+ y = 0 

respectively , where R ^ 0 (R £ 0), and are so related that 

y(a) = rj (a), if (a) = v'(a) t y"(a) = i?"(a), 
then \y\ < \rj\ forx < a(x > a) and |y|>|» 7 | for x > a(x < a) 'until a value 
of x is reached for which y = 0 and rj = 0 respectively. 


of the equations if" + qf + j y = 0 and of if" + qf + ( 
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In fact the interval (a, b) under consideration is regular and of either 
kind for the self-adjoint equation. 

8. An Example. It is possible in many cases to form a good idea of 
the character of the solutions by means of the preceding results. As an illus¬ 
tration we consider the equation 

y"' + / + x,j = o, 

to which the adjoint equation is 

*" + *'-** = 0 . 


We choose y|, y 2 , y 3 as the principal solutions at z = 0 : 

y.( 0 ) = o, 

'J\ (0) = 0, 

yl' (0) = l, 

y«( 0 ) = o. 

!/i (0) = 1, 

A ( 0 ) = o, 

y,(0) = i. 

•A ( 0 ) = o. 

A ( 0 ) = o. 

The adjoint solutions (see equations (3) then 

satisfy the conditions 

*. (0) = 1, 

*; ( 0 ) = o, 

*1 (0) = -1, 

(0) = 0, 

*t (0) =-l, 

A ( 0 ) = o. 

*. (0) = 0, 

*5 ( 0 ) = o, 

o 

II 

• 


Tho interval * < 0 is regular and of the first kind (test for regular in¬ 
tervals, special case) and the interval z > 0 is regular and of the second kind. 
Kach of the corresponding parts of the integral curve is therefore spiraliform 
with the point A corresponding to z = 0 in its interior. 

1‘urthermore it is clear that y,, y 3 , z„ z s , are even functions of z and that 
y„ z 2 arc odd functions, since the equation is unchanged if -a; be substituted 
for x. The integral curve must be symmetrical with respect to ic 2 = 0. Hence 
the integral curve has the form indicated in the schematic figure (Fig. 8). 

The zeros of y, and y 2 and indeed of any two members of the linear 
family c, y, + c* y 2 , will alternate for z > 0 and for z < 0 (Separation' Theo¬ 
rem for Regular Intervals). 

In order to consider the question of the distribution of the zeros of y„ 
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if, for x > 0 we note that the forward interval of oscillation will decrease with 
increase of x and always be less than that of 

y* + y = o 

which is precisely 2w (Comparison Theorem). Hence as x increases the 
successive zeros of y, and of y* come at intervals none exceeding 2ir and 
tending to zero. 

The function //, also vanishes for x = x, > 0, as a direct computation 
shows. The successive zeros of y 3 (x) will fall of course at intervals less 
than 2 t r which tend to zero as x increases. 



9. The Theorems of Liouville. The only known theorems of the 
kind we have been considering concerning the solution of the third (and 
higher) order equations appear to be those due to Liouville.* 

The type of the equation considered is 


d 

di 



d 

d* ,J 


+ Xy = 0, 


where If, L are positive in (a, b) and X is a parameter. Now when X = 0 
one finds by direct integration that 


a) 



• Liouville’a Journal, vol. 3. pp. 561-614 (1838). 
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This function does not vanish for x * a. Hence for X = 0 the interval (a, b) 
is of the first and second kind, the corresponding arc AB has the shape of 
a part of a convex oval. Suppose now that the equation is reduced to the 
form (6) and that X is made to increase. By the test for regular intervals and 
the Comparison Theorem, the interval (a, b) remains regular of the second 
kind and the forward intervals of oscillation decrease. In addition, we know 
by the theorem of §7 that if we fix y, j/, y" at * = a, the first greater value 
of x for which y = 0 will decrease with increase of X. 

The results of Liouville are still more definite, such as the following: If 
we consider a particular solution y(x) for which at x = a 

v = a ' li y = b ' Tx L Tx y = e ' a> °- 4 >°- c >°* 

all of the greater values of x for which y vanishes must decrease with increase 
of X. 

This result is equivalent to his oscillation theorem which states that there 
exists one and only one value of X, say Xo, for which y vanishes at b and not 
within (a, b), and one and but one value X„ such that y vanishes at b and once 
within (a y b) and so on. 

Liouville’s results belong to another category than those which we have 
obtained. We have given only theorems concerned with any arbitrary in¬ 
crease of the coefficients. A special kind of variation was considered by 
Liouville which is however of great importance. 

Princeton University, 

June, 1910. 
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Reprinted from Bull. Amer. Math. Soc. f May, 1911, Vol. 17, p. 392. 


(G) Professor G. D. Birkhoff : “ A direct method for tlie 
summation of developments in Lamp’s functions and of allied 
developments.” 


G. A pair of ordinary linear differential equations of the 
second order 

( p H 

L(it) =-j^ + <£(•»■)" + CM*) + M*)] H = °» 


MM = ;^+ My)'' + [My) + °> 

[^(.r>/(//) - l*(x)c(y) > 0], 

will both have solutions n^aj), i\(y) which vanish for x = m„ m v 
y = n v n t respectively for certain sets V, /x. of values of the 
parameters X, fi. These solutions give rise to the formal 
development of an arbitrary function f(x, y) of two variables, 

/(•*•> y) - E<v</*h(y), 

including as a special case developments in Lamp’s products. 
Professor Ilirkhoff treats the question whether this series repre¬ 
sents the function by the use of a contour integral 

v)G(x ’ * )H(y ’ T, ^ dvdxd ' 1 ’ 


where G and 7/ are the one-dimensional Green’s functions 
belonging to the first and second of the given equations respec¬ 
tively, under the given boundary conditions, and C and D are 
properly chosen contours in the X and ft planes respectively. 
This iutegral yields an explicit expression for the sum of any 
number of terms of the given series, and, by means of a study 
of the asymptotic character of the solution of the given equa¬ 
tions for X and /z large in absolute value, this series is shown 
to tend toward /(.r, y) t where /(.r, y) is continuous, under suit¬ 
able restrictions. The limiting value at a point of discontinuity 
is also discussed. 

The method outlined above is of decidedly greater power than 
that of integral equations which Hilbert has recently employed, 
and is to be characterized as an extension of the method of 
residues. 
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Note on the Expansion of the Greens Function. 

By 

George D. Birkhoff of Princeton (D. S. A ). 


In a recent paper in the Mathematische Annalen (Vol. 71, p. 76 — 87) 
Hilb has proved (p. 80—87) that the Green’s function G(A*; £, ) associated 

with the differential equation and boundary conditions 
d. 1 V 

^ + (^ + 0^-0, F(0)-F(l)-0, 


(A a complex parameter and l a function of the real variable x which 
has continuous first and second derivatives) may be expanded into the 
infinite series 





X)-V 


even if the function l is not restricted to be real*) The numbers A/ and 
the fonctions (f>j which appear in the series are the characteristic numbers 
and the normalized characteristic fonctions respectively, associated with 
the given equation and conditions. 

Hilb’s method consists in a consideration of the asymptotic properties 
of the characteristic numbers, the characteristic functions and the Green’s 
function, together with the use of a contour integral. 

This expansion of the Green’s function in products of characteristic 
functions is implicitly contained in a paper which I published in the 
Transactions of the American Mathematical Society over three years 
a g°' **) An ad *quate account of the methods and results of this paper 


) A slight modification of the series must be made in certain cases; see Hilb, 
ioc. cit. p. 87. If l is real, the theorems of Hilbert (G6tt. Nachr. 1904, p. 49-91 and 
f K 213 - 2 f and E Schmidt (Gottingen dissertation 1905; in particular p. 12) ensure 
the validity of the expansion provided its uniform convergence is taken for granted. 
B °“ Ddary / alu * and Expansion Problems of Ordinary Linear Differential 

f 2 T.9<r «M £“1, S ° C - V ,908) ' p - 373 -’ 95 - S “ als ° B“H- A». Math. Soc„ 

(1906 1907), p. «9. Strictly ap.ak.ng, it i. duly when I is unlimitedly differentiable 
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may be found in the Jahrbuch fiber die Fortschritte der Mathematik 1908. 
Hilb’a method does not differ markedly from my own. 

Hilb has devoted a second paper (J. f. Math. 140, p. 205—229) to a more 
general discussion of expansion theorems, which is decidedly interesting 
in that non-homogeneous equations are also treated. However, in so far 
as the homogeneous case is concerned, the results are contained in the 
cases n — 1 and n — 2 of my paper, and the method is essentially the 
same as my own. 

On learning of my priority, Hilb has very courteously made acknow¬ 
ledgement of it.*) 

In this note I wish to indicate explicitly the application of the ex¬ 
pansion theorem of my paper**) to the Green’s function. Let there be 
given a linear differential equation 

L(u) + s + + • • • + P.u + Xu - 0 


where • • •, p n are continuous functions of x, together with their deri¬ 
vatives of all orders, but are not restricted to be real, and where 1 is a 
complex parameter. Also let there be given any set of n linearly inde¬ 
pendent boundary conditions 


"'•<“>-2[««^+ft- ° 

Jm 0 


where the constants a t) and arc not restricted to be real, but where 
certain inequalities obtain, so that these boundary conditions are regular***) 
The expansion theorem referred to states that any function f of x 
which is made up of a finite number of pieces, each continuous and having 
a continuous first derivative, may be expanded in a convergent series 


6 

f f(x) v t (x)dx 

2 -»<(*) 

' f «*<(*) v t (x) dx 

a 

that the above expansion comes directly as a special case of my theorems; nevertheless 
this stronger assumption on l was not used by me in this connection, but only in 
discussing certain asymptotic series to terms of ail orders. 

•) Sitznngsberichte der Physikalisch-Medizinischen Soziet&t zu Erlangen 43 
(1911), p. 68—71; see p. 70—71. I may note in passing that in Bounitzky’s paper, 
Sur la fonction <U Green des Equations differentielles liniaires ordinaires, J. de Math. 
(6) 6 (1909), p. 66—125, appear a number of theorems also to be found in my paper. In 
particular, the notion of adjoint boundary conditions which is central in his paper is 
carefully developed by me (loc. cit. p. 876—877). 

••) Loc. cit p. 390. ••*) Loc. cit. p. 382—383. 
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where u, and v t are the characteristic functions belonging to the given 
equation and conditions, and to the adjoint equation and conditions, re¬ 
spectively.*) This series converges to the value ~ [/’(*—0) -|- f(x + 0)] at 
points x between a and b but to certaiu two fixed linear combinations 
af(a +0) + Pf(b— 0) at x = a and x = b, where the two pairs of Con¬ 
stanta a and p may be explicitly determined. 

Let us now take f to be the Green's function G(x, 5; X) of the given 
equation and conditions. In this case one has 

* 

a 

so that the above expansion may be written 

Zj w 
* 

b 

provided we normalize u, and t> ( so that f u,(i) v,(x) dx — 1. 

If n > 1 the theorem stated above shows that this series converges 
to G(x, £; X) for either a<x<b or likewise for a < £ < b since the 
Green’s function is continuous in x and Moreover the series will con¬ 
verge uniformly on account of the asymptotic form of u„ t> ( and X, **], 
and hence the expansion holds for all x and 

If m - 1 and a<x<b or a < £ < b, while x + g, the series will 
converge to G(x,£; A) since the Green's function is continuous for such values 
of x and || for z — | it will converge to }[G(£-0,$; A)+ G(£ + 0,|; A)]. 
There remain the possibilities * - { - 0 and a- - £ - 1 when it is necessary 
to consider certain additional terms***) which here reduce to zero. 

Therefore we -have: For a given differential equation L(u) + Xu =- 0 
and regular boundary conditions 1F,( U ) - 0 (*—1,2, - «) the associated 
Greens function G(x, |; A) may be expanded in the above series. Ifn>l the 
series converges uniformly to the Green's function. Ifn-1 the series converges 
to the Green's function for x + £ and to i[G(£_o,£ ; X) + G(|-f0,|; 1)] 
for s = £.f) 

*) For definition ° f “ijoiat equation and boundary condition, see p. 375 -S 77 . 

A suitable modification in the aerie, must be made if f\( x )v,(x)dx=0 ; aee p. 380. 

2 <*■ P- 383 and p. 388. ™) Loc“cit p. 394-395. 

with lie " Ca6e , Wh ‘ Ch J Hllb treaU ’ the e, l ua tion and condition, coincide 

with the given equation and conditions so that one may take . 
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NOTE ON THE EXPANSION PROBLEMS OF ORDINARY LINEAR 
DIFFERENTIAL EQUATIONS. 

By George D. Birkhoff (Cambridge, Mass.). 


E.ir.lto 4.1 tomo XXXVI (»<» .to. |fi|) 4c. rundtronil d*l Cirtolo MaU>mnUeo dl l-nUrmo. 

A4«jaaaia 4«l i| »fnl« i«l). 


Nearly five years ago I formulated certain boundary value and expansion problems 
of considerable importance, and gave a solution of these problems ')• 1 ° a paper 
which has just appeared in this journal *) Tamarkine treats the same problems to 
advantage, but casts doubt upon my work. I wish to make it plain that this doubt is 
without substantial foundation, and also to add one or two details which arc not 
without interest. 

In my paper were considered certain sets of functions formed by those solutions 
of an ordinary linear differential equation of the n '* 1 order, 

+ p.„+ ■ • • + p. w* + - * («<£*£;»). 

which furthermore satisfy n boundary conditions of the form 

«, + ••• + *.<*(«) + + ••• + ft. ,# W = °- 

Under certain restrictions I answered some central questions which arose concerning 
(t) the distribution of those characteristic values of the parameter X for which 
the differential equation has a solution satisfying the boundary conditions, 

( 2 ) the asymptotic properties of these solutions or characteristic functions, 

( 3 ) the convergence to an arbitrary function /(x) of its expansion in a linear 
series of these characteristic functions. 

The means of considering these questions were furnished in a preliminary paper 
written expressly for the purpose 3 ). 


') G. D. Birkhoff. Boundary value and expansion problems of ordinary linear differential equations 
fTransactions of the American Mathematical Society, vol. IX (19^8). pp. J 75 -J 9 SJ- 

a ) J. Tamarkise. Sur quelques points de la tbeorie des equations differentielles liniaires ordinaires et 
sur la generalisation de la serie de Fourier (Rcndiconti del Crcolo Matematico di Palermo, t. XXXIV 
(2 0 sem. 1912), pp. )4S*)82). 

3 ) G. D. Birkhoff, On the asymptotic character of the solution of certain linear differential equations 
containing a parameter (Transactions of the American Mathematical Society, vol. IX (1908), pp. 219-2JIJ. 

Get. Ualtm. nurm,. «. XXXVI (*• »«-. i 9 i j). — Sum F ««o il »6 ns.ggio If.). I 
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Basing his work on this preliminary paper, and following my method and nota¬ 
tion, Tamarkine takes up the case when the differential equation is of even order 
(n = 2f*), whereas I had treated the case of odd order (n = 2,u — i) and had only 
outlined the entirely similar treatment in the case of even order. 

The first theorem which I proved deals with the distribution of the characteristic 
values 4 ). By a method of simple and elementary character, I established this theorem 
in the case n = 2 f* — i and indicated the corresponding similar treatment possible in 
the case n = m; in this latter case it is only necessary to factor the quadratic 
expression A which equated to zero gives the characteristic values 5 ). From the purely 
formal point of view this factorization yields at once the two necessary forms of asymp¬ 
totic series for the large characteristic values, and shows that, at least if these two 
scries arc not identical, the characteristic values belonging to the two series, as well 
as the roots of the quadratic expression, are different for large values of the parameter 
X; if we equate each of the factors of A separately to zero, two equations arise which 
are amenable to exactly the same treatment as the single equation which arises in the 
case n = 2 fi — x *)• If the two asymptotic forms are identical 7 ), an easy extension 
of my method may be employed. In all cases I inferred that to each asymptotic series 
corresponds one simple zero of A, and that these can coincide in a double zero for 
large values of the parameter only if the two series are identical. 

This conclusion was embodied in the first theorem, which stated that the charac¬ 
teristic values are «in general simple » (that is, do not have more than one corresponding 
characteristic function). Of course I realized that there may be an infinite number of 
double characteristic values, a possibility exhibited indeed by the simplest known cases •). 

In the course of the proof of the same theorem for the case n = 2fi, I encoun¬ 
tered a further difficulty in a special case. After making a transformation to the p plane 
(X = p'), I was able to assert that there existed solutions of the given differential equa¬ 
tion preserving a definite asymptotic form in cither of the two sectors 


D » : 


The indicated necessary forms of the two sets of characteristic values might restrict 
one or both of these sets (taken in the p-plane) to tend asymptotically toward the ray 
common to D t and D t ; under these circumstances it was not possible to construct 


4 ) See *). PP *80-387- 
s ) See *), equation (j8). 

®) See *), equation (J4). 

7 ) This is an extremely special case which can only come about if the coefficients />, (*). t( x ) 

are subjected to an infinite number of equations of condition. 

*) For (he sense in which my statement of the theorem is to be understood see M. BOcher, 
Sounder, Pro tUm m On, Dimension [Proceedings of the International Mathematical Congress, Cam- 

bndge . 9 ,a, Sec. «.). Professor BOche, is kind enough to inform me that his references stand in their 
original form, as given last August. 
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certain circles of fixed radius lying wholly within either D t or D t , and so the proof 
mentioned above, which depends on the preservation of the form of A throughout 
such circles, broke down. 

If this were the case, l argued as follows: The values of p which correspond to 
characteristic values of > have necessarily the form 


?k = 2 kite + T# * 



(*- 


where y 0 is real and \E t \ is limited. Moreover the quantity is independent of the 
particular choice of the functions £,(*)» ...» p.(x), so that if these functions are 
replaced by p/>,(*)» ...» pP m (x) respectively, the above formula remains valid. More¬ 
over, for |p| ^ i the quantity |F 4 | remains uniformly limited. Hence, for | p | large, 
all the characteristic values which exist must lie within circles of arbitrarily small 
radius r described about the points 

1 *"‘ +r.« • 


Now let fi vary continuously from o to I. For p = o the differential equation redu¬ 
ces to 




of which solutions are known to exist that preserve a definite asymptotic form in the 
entire p-plane and accordingly in D, and D t combined. Thus the original form of 
proof establishes that for p = o one and only one of the zeros of A lies in each of 
the circles above defined. As ft varies from o to t, the zeros of A vary analytically 
with ft and cannot pass out of these circles. Accordingly these zeros must continue to 
exist for ft = i, one within each circle. I indicated this treatment of the special case 
very briefly ®). 

Later I have observed that the solutions preserving asymptotic forms in D t and D t 
which I had defined continued to have this form to any specified distance outside of 
these regions lo ) 

Both of these forms of argument are superseded if the observation of Tamarkine 
is correct, that n solutions always exist preserving a single asymptotic form in the 
region D formed by D t and D t . Unfortunately his proof of this essential fact is 
outlined so briefly that I have not been able to follow it. 

The oversight on my part in connection with this theorem to which Tamarmnb 


®) See «), p. ,87, 

lo ) See *), where a region S may be defined to be a region 

«fp(*)) ^ Hfp w, (*) + u], ..., S[ P tv.., (*)] ^ 8» fp to. (*) + tij 
( 9 i denotes ■ the real part of •) instead of as in equation (5) of that paper. Practically the Only other 
modification necessary after this change is io the fundamental inequality (7), which is to be replaced by 
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calls attention ") consists in an incorrect calculation of the asymptotic form of the 
sets of characteristic values in a special case. In this case I did not include terms in 

_ j_ _ ± 

k \k (k = i, 2 , ...) but only terms in k°, k~\ k~\ ... in a series pro¬ 

ceeding according to positive integral powers of k *, and this omission has no effect 
on any of the arguments of my paper. 

A further remark may be made concerning the nature of the characteristic values. 
As has been stated, for large |X| a zero of A is cither simple or double. In the first 
case the characteristic value is also simple. For, otherwise all of the first minors of A 
vanish to at least the first order when a takes on this value, and the determinant of 
these minors vanishes to at least the n ,h order. But this determinant is identically equal 
to A" -1 , so that A must vanish to at least the second order, contrary to hypothesis. 
In the second case the characteristic value is at most double, since not all of the second 
minors of A vanish for |X| large ,J ). In every case, then, cither the characteristic 
values are simple, or pairs of characteristic values may coincide to give a simple or 
double characteristic value. This is the simplest possible type of degeneration and is 
precisely the state of affairs that I had in mind at the time of writing my paper. 

Tamarkine states that two distinct scries of characteristic values may, one or both, 
correspond to a double value for large |X|, and signalizes the fact that I have over¬ 
looked this case ,3 ). But, as I have indicated above, this case can never arise. 

It ought to be noted that in my list of asymptotic formulas for the characteristic 
functions * 4 ) I have not provided for the extremely special case n= 2 ,u in which the two 
asymptotic scries for the characteristic values are identical, or for the special case when 

these series proceed according to the powers of Tamarkine docs not give any 
space to such formulas. 

I come now to my second theorem ,s ). The proof depends on the evaluation of 
a sequence of contour integrals 

(,) 7„yzn G (*>»: *)/«*' <*- ■.»....) 

where f(s ) is the function to be expanded in series, and C k is a contour in the com¬ 
plex X-planc. By making a transformation of the function G(x,j;X) to a suitable de¬ 
terminant form ,6 ), I tried to make it almost visibly evident to the reader that, when 
simple restrictions are placed on the function f(s) (e. g., that it be made up of a finite 


"> See *), p. 
Ia ) See »). p. 


JS8. 

* 74 . Cf. 


)• P- for a like reasoning in the case n = 2u — i. 


n = 


* 3 ) See a ), p. J7j, theorem and footnote. 

* 4 ) See '). P *89. 

* s ) See *), p. j 9 o. 

Se * 1)1 f0rmuU (s6) ’ and F- W where 1 indicate the necessary changes to be made for 
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number of pieces, each having a continuous derivative) and the closed contours C h 
form a sequence of expanding circles, this integral converges to [/(x — o)+/(x + o)], 
a < x < b y and to fixed linear combinations of f(a -f- o) and f(b — o) at x = a 
and x = b. Having given this form for G(x, j, X), I indicated each step of the proof 
without going into analytical detail that seemed to me obvious. 

Tamarkine takes over this determinant form of G and restricts himself to the 
case a < x < b. Concerning my work he says, « les raisonnements dc l’auteur, qui 
sont exposes d’ailleurs bicn britvement, ne nous paraissent pas suffisants » * 7 ). 

In order to make it clear of what character arc the details omitted in my proof, 
I shall repeat my discussion, enclosing in quotation marks that which is found in my 
paper '“*) and adding some of these details at the proper points. Let me first make 
plain the nature of the path of integration, and the nature of the function G. 

In the X-plane the integration of (i) is to be taken around a sequence of circles C k 
with centers at the origin and indefinitely increasing radii. It is necessary to determine 
the limit which the integral (i) will approach under these circumstances. We consider 
a typical part of the integral (i) in which we take X = p", and the path of integra¬ 
tion in the a-plane is the arc AB of a circle with center at the origin and lying in 
the sector 

, . Ir. „ .(/+ 0 - 

that is, we treat the integral 

(3) IS = J‘ d f£ n f"' G < x - *» f')/«*»*• 

There remain three other integrals ijy, to consider *•), whose limits 
may be obtained immediately by considerations of symmetry. 

Let xv t , ... , w m be the roots of the equation 


( 4 ) “'" + 1=0 

arranged in an order such that for the values of p in question 

(5) wCpm'.XWCp w.X ••• < w (P“' h )^o^ 91 (P%.,)< ••• <«(P“'.) 

and let the bracket notation [a] stand for a function of p having the form 


± + L 
p + 7 


where a is a constant and b is a known function of the variables x and s t and where |£| 
is uniformly limited for |p| sufficiently large and for all values of these variables. Then 


• 7 ) See *), p. ,68. 

,8 ) Cf. *), pp. ,9,-596. In one or two places a word or two is changed because I have taken 
the case n = ajx instead of n = an — 1 in order to make comparison with Tamarkine’s paper more 
easily possible. For the same reason I have uken a = o, i = t. 

*•) See '). P J95- 
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the determinant formula for G referred to above is 


(6) up—G(x, 5; p‘) = 


-'W.. 


a ,(?y 


(l = l, 2, .... »;/ — I, 2.. I), 

(< = I, 2, .... m). 

(«' = >. *.«), 

^ ™ '• 2 ." :}* = }*+ 2 . .... n). 


T - (x ' f) • • • ? w ? -aT®-- • • • ?) p) 

l’~ (x - p) • • • Vsr • • • T - (x - p) B - ( ” p) 

where the following abbreviations have been used 

f) = [*.'■;•] -f- «**>[!».«.*•] 

p)=+1?. «*■;■„] 

?„,(*. f)=[(».•{'] 

(7) U(*,s; = 

/-• 

.... 

*»,(i>>=[?.i +*^*[+,1 (?, * o, +, ^ o), 

*>,(p) = + DU (?. 5^0,+,^ o) “). 

We can now give ihe proof referred to above: « From this form of G we can 
easily determine the limit of 

. Assume the sequence of circles in the f-plane so chosen that their distance from 
the nearest points f ,„ p,„ ■•) is at least i> o. This refers to the minimum distance 
of .*«> Pm from any point of the circles. In the proof of the theorem of § 4 
Pn. Pm were found to be approximately equally spaced along a rav parallel to the 
upper bounding ray of the sector (a). The above construction of a sequence of circles 
is therefore possible. Then on AB when | P | is large we have 

( 8 ) ![?,] + f “ l ‘[Ul > K> o •*) and likewise -f- [+,]| > /f > o. 

In fact if the equality 

m+ ‘"-ru= T 

obtains, where |,| is small, we infer at once that the point p lies near .0 one of the 


lh ' ’ P ' dl1 »■ = ?«• t. — . 'h' «rtm of the form £/,* in [ f ,J and [+,.] are to 

be r eplaced ^ by E/,‘, but the late, argument remains unaltered. This necessary b„, almost trivial change 
was unnoted by me, due to my oversight in calcubtion referred to above. 8 

a ‘) These are the images in the f -plane of the characteristic values. 

”) Loc. cit. *). 

one TA,,A, " mE ' S Umml ~ P “* « * * 
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points p a , p m ». For from the last equation we have 

+«(?) 

where e(p) tends to zero as |p| becomes infinite. Taking logarithms then we find 
pu- = 2kT. |'— 


+ ,°s ? + 


where k is an integer, where the logarithm on the right is the principal logarithm, 
and where t*(p) tends to zero as |al becomes infinite. But if I9I is small this ensures 
that |p | lies near to one of the points p„, s m since the quantities p /|f p m are the 
roots of this equation for 9 = 0. We may suppose then that the inequalities (8) obtain 
along A B. 

« In view of (8), every element of h p"“’ G given by (6) is bounded on A B , 
since the exponents have negative or zero real parts [see (>)] ». It may be noted that 
(8) may be written 

KM > * > O 0 =1, a). 

« Consider those terms of I‘~ B ‘ which do not contain the first element of the last 
column ». There are a finite number of such terms each having either a form 

(?) yV-rt*. f>/'*■’*'■(• + ^ + y)/(•»>' 

or a similar form in which the exponential coefficients x or s , or both, arc replaced 
by x — 1 and s — 1. The outer exponential factor conics from the element (not the 
last) of the first row of the determinant expression (6), being contained as a factor 
in the element. The factor ^(x, p) is the bounded factor remaining in the product of 
the elements of the first 11 columns after this exponential factor has been removed. 
The inner integrand is a typical element of one of the elements (not the first) of the 
last column. « The variable s is confined to the last column. When we integrate these 
terms as to s they have the form M/i (M bounded) as an integration by parts be¬ 
tween the points of discontinuity of f(s) shows". In fact such an integration gives 




when /(j) is continuous; certain terms 


+ y£‘-'~‘‘Emds\, 


0 <*) 


arc to be added if /(*) is discontinuous at s = s. In all cases the factor in brackets 
and hence the whole expression is of the form ,\f/p (.\f bounded). «< But these terms 
all contain as factor an element of the first row, not the last, besides other bounded 
elements ». This refers to the factor 

?) 


84 



«Sincc these first row terms are small for |p| large, except the p ,b one when p is 
near to B , we conclude that the p-integration of them will yield only terms which 
become indefinitely small as |p| becomes indefinitely large ». In this connection wc 
observe that by (j) 

AC |pl« (*>o; I—i f .... p-i), 

so that as stated for i < pi ^ 

0)1 <e 

where e is arbitrarily small; hence the integral is less iu absolute value than 

L 

and tends toward zero as |p| increases indefinitely. Also, for i = p. we have by ( 5 ) 

so long as <«>•) 

0 °) ?<Jarg,^£±J 2 «_| -) (»>.) 

no mailer hove small .5 is chosen. Hence in this case the integral is less in absolute 
value than 

where B , denotes the point of the arc AB in which it is met by the upper ray of 
the sector (io). Thus the integral is less in absolute value than 

2-tM -f MX 

and must approach zero as |p| increases indefinitely. 

« Accordingly wc need only consider the terms which contain the first element 
of the last column », havmg proved that all other terms approach zero. « This element 
had a coefficient i as it had originally in G(v, ,, >.) Thus ye luve 

rs -zh-X , '£tr .(—■+‘ , r ) + ?)/«»<+■. 

where «, signifies a quantity that tend* to aero as |s| increases. Here again we may 
restrtet ourselves to the terms that do not contain ,/s .. For the second of the two 
terms 

. 

obviously tends to aero as |p| becomes infinite; and inasmuch as we have 

|0/(0*| < ( ,< A 

|07(0*1 < M, ^ argp^CLhii' _ 

If—(, 0/(0*| < (e±i>_« _ * ^ , rKf ^"(/ +0.) 

’ 4) Along the upper ray of the sector (a), vanishes. 
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I 


the first term is less ia absolute value than 




for i < f i, and is less in absolute value than 


< *./*■*". 


for i = Hence these terms approach zero as | p | becomes infinite. 
« Finally therefore 


(ir +•- 


where e, signifies a quantity that tends to zero as |p| increases ». 
« Integrating by parts we find 


<**> f-= /(I ~ 0) + (!=.,». K ) 

r P 

where Cj is certainly small unless * = p and p is near to B ». More precisely, we have 

= /£ x -~ 0) - - -JjjM i>, 

at least when /(x) is continuous; otherwise, terms of the form 




* — o) — fCi + o 


('" < *) 


must be added. Hence the value of is 


- *'*"/(<>) + l'[/(‘ - °) -/<* + O)]^- 9 - K>)ds. 


For i < p f this obviously tends to zero uniformly on AB as |p| becomes infinite; 
each term being of a form before discussed. For { = (* and p restricted as in (10), 
we see that the same is true, while on the small remaining part of AB the term re¬ 
mains finite. Summarizing, we see that we can take 


(or > < p, 

(.3) for i = (i f 

.,<Af, for i = 

where c and S are arbitrarily small, a From this we conclude that 


Miw4. On. Maim. Mm, i. XXXVI (a® l(B . — Suaaaio U at 
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where c tends to zero, since we have 


< 2*e 


kkjTM 

Kl< / \ J r\ + f!'\^r\< 2,ct+M ' t 

tt and therefore conclude that 


(«< *>. 


(i = !*)• 


°) 


+ *, 


where e, tends to zero as |p| increases without limit#. 

By symmetry we evaluate the limit of the other partial integrals and thus find 
that (i) tends toward j[/(x — o) +/(x -f o)], completing the demonstration. 

There is a single other matter to which I wish to call attention. I stated, with 
only a hint of the method of proof, the results for the end points x = o or x = i 
as well as for o < x < i, while Tamarkine docs not discuss the end points. The 
reader who looks over the preceding argument will find that for x = i all of the 
preceding argument remains unaltered so far as the evaluation of the term arising from 
the first element of the last column in ( 6 ) is concerned. In the set of terms first 
considered the argument is somewhat modified because now the exponential factor 
appearing in the first row terms after the pt' h 

reduces to i, and is not infinitesimal on A B. It is thus necessary to consider terms 
of the form [see ( 9 )] 

(,4) (- + ^ + £)/«'*. 
or the form 

(,;) iT7==7 + ^ + f )/(,»,. 

Let us consider the first of these. The factor ? (i, p) is the product of elements in the 
first n columns of the determinant ( 1 ). The integration as tor is conducted precisely 
as before in the terms ( 9 ) and gives 




where « 4 is restricted essentially as s, was in ( 15 ). Since ? (i, p) is limited, the term 
of the integral containing «, tends to zero as |p| becomes infinite, and thus we need 
to consider only the limit of the term 
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II 


as | p | increases. But the factor 9 ( 1 , p) has obviously the form 

' + f + f 


Since we have 
if p lies in the sector ( 10 ), we see that 


«.(pK(p) 

lim = lim = o 




= ?, +. 5* °). 


where e ? tends to zero as |p | increases indefinitely. Therefore 

The first integral on the right hand side may be written 


and the second integral is clearly less in absolute value than M t 6. Hence the term ( 14 ) 
approaches a constant multiple of /(o). Likewise each term of type ( 15 ) approaches 
a constant multiple of /( 1 ). This establishes the statement of the theorem for x = 1 , 
and by symmetry for x = o. 

An easy reckoning gives us for the value of ijj" at x = 1 


Dr A ~;=DU = 

O ... O I ... I j/( 1) 

«X P.X-• • • PX (p.^X'V^ 


• • • «X* &X-. • • ■ P.X 

where D stands for the minor of the last element in the first column of the deter¬ 
minant. The value of Tc at x = x by a like reckoning found to be 

(■ 7 ) /£' = V 

where U' is obtained from U by the interchange of and Hence the series 

converges to U U' at x = 1 . 

To obtain the formula for the sum of the series at x = o we need only observe 
that the transformation x=i—x interchanges the rdle of o and 1 , and at the same 
time of fl ( and a. (1 = 1 , 2, ..., n). Hence the series converges to VV' at x = o, 
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where V and V are obtained from U and U' by interchanging p ( , P.,/(i) with 

Entirely similar formulas exist in' the case of n = 2 ja — 1 •*). 

Cambridge, Maw., March 10, 191}. 


George D. Birkhofp. 


**> In my paper *) I put the emphasis upon the somewhat simpler case of odd order (»—au—«\ 
Nevertheless formally the boundary problems of odd order can always be reduced .0 problems of even 
order e. g. the problem 

reduce, ,0 '+»« = ■>, .(o) = „(,X 


tf+\u = o, u(o) = u(l), «/(o)= ./(I); 
more generally the order may be multiplied by any integer. 
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Reprinted from the Proceeding! of the Natiowal Acad cm y of Sciences. 
Vol. 3. pp. 6Sb-dS9. November, 1917 


A THEOREM ON SERIES OF ORTHOGONAL FUNCTIONS WITH 
AN APPLICATION TO STURM-LIOUVILLE SERIES 

By George D. Birkhoff 

DEPARTMENT OF MATHEMATICS. HARVARD UNIVERSITY 
CommunicAted by E. H. Moore. October 10. 1917 

1. The Theorem .— An infinite set of continuous functions u x (x), « a (x), 
... is closed on the interval 0 g x £ 1 if there exists no continuous 
function f(x) not identically zero for which J %l 0 f(x)u m (x)dx vanishes for 
all n; the set is normalized if /Jk* (x)dx - 1 for all n; it is orthogonal if 
f'o (*) (x)dx - 0 for m 4= n. Most of the series of mathematical 
physics are linear in closed normalized orthogonal sets of functions. 

Theorem. If t*i(x), «*(*)> • • • form a closed normalized orthogonal set 
of functions, and if U\(x), u 7 (x), . . . form a second normalized orthogonal 
set such that 


2 («.(*)“*.(*)) “.O’) (0S*,ygl) 

»- 1 

converges to a function E (x, y) less than 1 in numerical magnitude in such 
wise that the series multiplied through by an arbitrary continuous function 
f(x) can be integrated term by term as to x and yields a uniformly conver¬ 
gent series, then the set ui(x), tij(x), ... is closed also. 

Proof. If the set zii(x), u*(x), ... is not closed there exists an / 
not identically zero such that J'of(x)u m (x)dx vanishes for all n. In 
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this event, if we multiply through the equation of definition for H by 
/to and integrate as we may by hypothesis, we find 

X'/(*) H (*-y> **=2 /.'/(*) (*) <**•«. (y), 

mm | 

where the series on the right-hand side will converge uniformly by 
hypothesis and so represents a continuous function. 

This series on the right has the value/(y). In fact the difference 

• 

- 2 X'/w “.(*)<**•«. 6) 

m - I 

is a continuous function „(y) such that fW(y)u.(y)dy vanishes for all 
n, precisely because the set u,(x), u,(x), ... is normalized orthogonal. 
But the set u t (x), u-.(x), ... is closed by hypothesis; hence we infer that 
vanishes identically. Hence the right-hand member of the preceding 
equation has the value/(y). That equation may now be written 

J 0 '/(x)B(x,y)dx=/(y). 

If the maximum numerical value of f(x) for 0 £x § 1 is F *0,and if 
this value is taken on for y - y 0 , we get 

I X'/ to n (*» >o) dx\-F. 

But this is impossible since by hypothesis | // | < 1 and | / | ^ F. 

The set u x (x), iii(x), ... is therefore closed also. 

2. An Application. The Sturm-Liouville series, in a specialized but 
typical form, arises from the set of functions Wi(x), u t (x), . which 
are the solutions of a linear differential equation of the second order 

+ ( x - g(x) ) u « o 

satisfying boundary conditions «(0) = u(l) - 0, for the ordered set of 
parameter values X„ X,, . . of X respectively. We will assume that 
g(x) and dg{x)/dx are real and continuous. 

By means of the methods of Sturm it is proved that u t (x), u 2 (x), 
may be taken to form a normalized orthogonal set in which u ’(*) will 
vanish precisely n times within the interval 0SiS| And the meth 
ods of Liouville give the asymptotic form of M** will be convenient 
for us to use the formula 

u n = V2 [ sin nrx + + M n (x,g(: r)) l 

L n ^ “ J 
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in which M„ is a continuous functional of x and g(x), bounded as long as 
g(x) and dg(x)/dx are bounded uniformly for all n. 

For the satisfactory investigation of the representation of an arbitrary 
function in Sturm-Liouville series it is necessary to know further that 
the set Mi(x), w 3 (x), . . . thus obtained is closed. A very simple proof 
and the first is due to Stekloff. 1 His proof uses, however, the theory of 
functions of a complex variable. There is indeed a special case, namely 
the case when g(x) vanishes identically and u m (x) - y/l sin mrx, 
when a short proof is possible by an elementary method. 

The theorem proved above makes possible an immediate demonstra¬ 
tion that the set is closed, once the asymptotic formula above and the 
fact for the special case are granted. Let us replace g(x) by ag(x) where 
a is a real parameter varying from 0 to 1. If then wi(x), tt 3 (x), . . . 
denote the members of the set for any particular a and u x {x) } iis(x), . . . 
denote the members for another value of <x, say a, it will suffice to show 
that the series for H will have the properties demanded in the theorem 
for | a — a | g b > 0. For in this event since the set is closed for a - 0 
(the special case), it will be closed for a ^ b by an application of the theo¬ 
rem; by successive further applications of the theorem it is shown in the 
same way that the set is closed for a £ 26, a £ 36, . . . , so that finally 
we infer that the theorem is true for a - 1, i.e. for the given set. 

In virtue of the explicit formula for u m (x) we have 

Um ( x ) - u n (x) - y/2 T ~ ^ cos nvx + M n(x t og(x))-M n (x~ag{x)) 

In n 3 

u,(y)-V2 [sin n,y + £«£5"!2 + 

Multiplying these two expressions on the right together we obtain the 
typical term of the H series. This series breaks up at once into six 
(2 X 3) other series all of which converge uniformly to a small value 
for |a — a\ small save the series obtained from the leading terms in 
both expressions, namely 



If we omit the small factor (<r — «r) <p (x), this series may be written 


sin nr (x + y) - sin «r (x - y) 
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Now the sum of an> number of terms of a series 2 (sin n*z)/n is known 
to remain uniformly bounded, 1 and the series is known to converge every¬ 
where, uniformly save in the immediate vicinity of the values z = 0, 
* 2», =*= 4 jt, etc. Hence the series displayed above converges uni¬ 
formly for all x, y save in the immediate vicinity of * = y, where, how¬ 
ever, the sum of any number of terms of the series is bounded. Hence 
all six types of series will converge to values small numerically for | o-o | 
small, and, when multiplied through by any continuous function may be 
integrated term by term as to x, yielding uniformly convergent series in 
y. Thus the H series will have the stated properties for | <r - a | g { > 0. 

3. Generalization. The theorem suggests at once a theorem in Gen¬ 
eral Analysis as defined by E. H. Moore.* If we employ a quasi-geomet- 
ncal terminology this generalization may be stated as follows: any set of 
orthogonal vectors in a functional space lying near enough to a complete 
set of orthogonal vectors in that space is itself compute. Another still 
wider generalization suggests itself: any set of vectors in a functional space 
lying near enough to a compleU set of vectors admitting a reciprocal set is 
itself complete and admits a reciprocal set.' This second generalization 

evidently plays the same part in relation to biorthogonal sets that the 
first does for orthogonal sets. 


*5* ? i“ ks0n ’ Rtnd ‘ Cin ' Matm Pat *mo, 32, 1911, (257-262). 

See Bull. Amer. Math. Soc., New York, 18, 1911-1912, (334-362). 

(942 «5). ,yPe 5 ” *" * r ' iCl ' °' “*«• **"'■ CML ACM. Set, 161, 1917, 
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(Reprinted from Toe American Mathematical Monthly, Vol. XXVUI, May, 1921.| 


AN ELEMENTARY TREATMENT OF FOURIER’S SERIES. 


By GEORGE D. BIRKHOFF, 1 Harvard University. 


The aim of this note is to treat the “remainder” after n + 1 terras of the 
Fourier’s series for a given function fix): 

%a 0 + (ai cos x + &i sin x) + (a* cos 2x + sin 2x) • • •, 

i r* w i r- 

a* = ~J fix) cos lexdx, b k = -J f(x) sin kxdx. 

It will be assumed that the function/(x) is periodic of period 2ir and continuous 
together with its first three derivatives. 

Let T n (x) stand for the sum of the first n -f 1 terms of the above series; its 
derivative 7Y(x) is seen at once not to exceed in numerical value 

(|ai| + |&i|) + 2(|aa| + |6 2 1) 4* ••• + n(|a«| 4* |M)- 
Furthermore, by integration by parts and use of the periodicity of fix), we find 

a* = — J fix) sin kxdx = — j* fix) cos kxdx 

-7i?£'f'" Msinkxdx - 

Consequently \a k \, and similarly | b k |, is less than 2F i /k 3 where F z is the maximum 
of |/"'(*) |. We conclude that | 7V(x) | is not greater than 


4 /,, ( f ’ + i + •' + i )<^>( 1 + ( 1 -|)+ - + (^ 1 -;)). 


1 Professor Birkhoff’s earlier paper in this Monthly, “Note on certain quadratic number 
systems for which factorization is unique,” appeared almost exactly fifteen years ago. His first 
published paper, in collaboration with H. S. Vandiver, “On the integral divisors of a" — b",” 
was published in Annals of Mathematics, 1904. His second paper appeared also in Annals . . 
1905. His third paper was a thirty page memoir in Transactions of the American Mathematical 
Society, 1906; and his fourth paper is the one referred to above. A complete collection of his 
mathematical papers 1904-1919,37 in number, is preserved in a bound volume at the mathematical 
seminary' of Brown University.— Editor. 
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so that 

(1) \T n '(x)\ <8F 3 . 

The expression T n (x) has also the properties 


( 2 ) 



j* T n (x) cos kxdx, 
T m (x) sin kxdx, 


for k ^ n, as follows by direct integration. Using the equations of definition 
of a k , bu in combination with (2) wc deduce at once 


(3) j cos kxdx = j* H,(x) sin kxdx = 0, (k m n), 

where R n (x) denotes the “remainder ”/(*) — T n (x). 

Now by means of the familiar product formulas of trigonometry we may 
express successively cos* x, cos x sin x, sin* x, cos* x, • • •, sin” x as linear combina¬ 
tions of 1, cos x, sin x, cos 2x, sin 2x, • • •, sin nx. Therefore wc find from (3) 

(3') Rn(x) cos - x sin* xdx = 0, (a+0< w), 

and, noting the equation 


cos2 “ 2 ~ r ° = 5(1 + cos x cos x 0 + sin x sin x 0 ), 
we infer from (30 

(4) £'«.(*) cos-= 0 

for any x 0 . For convenience we make the change of variables x = x 0 -f- t and 
obtain 

OT f + t) cos u £dt = 0. 

The limits can be made =fc * since the integrand is periodic of period 2 t r in /. 

By the aid of (40 we propose to obtain an upper limit for the “remainder” 
Rn(x o). We have 

I Kn(xo + t) - Rn(x o) I == \f(x 0 + t) - f(x 0 ) | + | T n (x 0 + t) - T n {x o) I 
where Fi stands for the maximum of |f(*)| and SF> is at least as great as 
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I Tn(x ) I by (1). Now the ratio 

y 

sin y 


(M-l) 


is positive and may be seen geometrically or otherwise to have its maximum 
value 7r/2 for y = ± r/2. Therefore it is clear that 


= t 


. t 

“5 


( 1*1 £*•>. 


whence we find 

(5) |ft.(*. + 0 - ft-M S N lain }(|. W = »(* + 8F,)). 

Thus we may substitute in (4') 

(5') ft.(xo + 0 - «.(*•) + »^ sin 

where |0| % 1. When we do so there results 


( 6 ) 


*.(*«) = - 


r: 


ON cos*" | sin ^ dl 


r-i* 


The numerator on the right in (6) is not greater numerically than 


Ar£ cos*-1 
Also the familiar formulas 


. t 
3m 2 


<f/ = 2iV jf cos’" ^ sin ^ (if 


4AT 


2n + 1 


x 


»/j 


cos" xdx 


show that 


1 -3 • * * n — 1 t 
2*4 • • • » 2 

2-4 *»n-l 
3-5 ••• n 


. (neven), 
(n odd), 


r /» /•»/* 

cos" xdz • J cos’* 1 xix 


2(n + 1) 

Of the two integrals on the left the first is of course the greater, and it follows that 

•»/» 


and thus that 


I C0S ’ xdx> 'I^r ij- (n = 0, 

X * 9 t C wn I 2* 

, co3 ’'2 dt = 4 i co, * , * fa > 2 \sr+r- 

If we replace the numerator and denominator in (6) by the greater and lesser 
values respectively which have now been found, the fraction is increased numer- 
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ically; whence finally _ 

(7) |fl.Wl < W,(2iH- 1) 

for every value x 0 of x and for every n^O. Thus the remainder approaches 0 
as n increases indefinitely, and the Fourier's series converges to/(.r). 

The method outlined above may easily be modified so as to deal with a more 
general type of function/( jt), for instance any periodic function with two deriva¬ 
tives. Essentially the only novel feature of the method is the direct use of an 
equation like (4) to obtain an upper limit for the remainder. 1 

‘ See Lebesguc, Lefons sur Us series IrigononUlriques, Paris, 1906, pp. 37-38, and de la 
Valltfe Poussin, BulUlins de VAcadlmU royaU de Belgique, classe des sciences, 1908, pp. 193-254, 
in particular p. 230 el seq. 
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Circular Plates of Variable Thickness . By Geoiiok D. 
Birkhoff, Ph.jD. t Professor of Mathematics , Harvard 
University. 

A S far as I am aware, only plates and shells of constant 
thickness have been considered in the theory of elas¬ 
ticity. The aim of the present note is to develop a method 
applicable when the thickness is variable f. The method is 
here applied to thin circular plates, nearl}' plane, and clamped 
at the outer edge. 

Case I .— The Incomplete Symmetrical Plate under 
Radial Pressure. 

Suppose that ordinary cylindrical coordinates (fig. 1) 
are adopted, with the axis of z along the axis of tho plate. 


Fig. 1. 



Tho plato is of innor radius a and outer radius by and is 
symmetrical about the plane 2=0. Let a be proportional 
to the thickness of the plate, so that z=ta and 2 =— ta may 
be taken as the equations of the upper and lower bases 
respectively. Here t is a small parameter, siuco the plate is 
thin. 

Radial forces are applied to the inner edge, so that points 

t I am greatly indebted to Mr. Carl A. Garabedinn, of Harvard 
University, for able assistance in carrying through some of the calcu¬ 
lations and for verifying those I have made. Mr. Garabedian is under¬ 
taking the consideration of more general problems by this method. 
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for which r=a undergo a displacement of amount e. The 
plate is clamped at the outer edge. Thus, if U and w denote 
the radial and axial displacements, the boundary conditions 
are 

U(a, 0) = e, U(i,0) = 0, 

*, (6 ,0)=^M=0. 

or 

Obviously such a plate is unstable, with a tendency to 
buckle. 

It is a cardinal fact of the theory of elasticity that the 
actual displacement of the plate will be such as to yield the 
minimum potential energy consistent with the constraints 
iinposod (Love, ‘Theory of Elasticity,’ third edition, p. 169). 

I5y Love (pp. 99-100, 141) this potential energy W is 
given as follows :— 


It is natural to introduce a new variable 2 *, such that * = tz f . 
When we replace z by z' and afterward suppress the accents 
there results :— 

+tt r/iau au-y_ usu 4ua<« 4aua«rn . . 

+ ■aba-- + ) ~ 4 7 r ar ~, av b-\j rtl:llr - 

Our assumption will be that all of the quantities involved 
can be expanded in ascending pow ers of t —in particular that 
U=U 0 + U|l + Uft 2 +., 

tD=w 0 +w l t + w a t*+ . 


It is to be noted that if in (A) we replace z by — r and 
w b X or t by -t and to by — u», the double integral is 
altered at most in sign. Since these transformations do 
not disturb our boundary conditions, and since W has a 
unique minimum, the special relations 

(B) U„,(r, z) U„«(r, — z) = w m (r 9 z) + w„(r, — c)=0, 

m=0,1,2,..., 

(B') U 2m+l (r, z) = w 2ut (r, z) = 0, in =0, 1, 2,., 

must obtain. 

The case of a plane plate shows that the energy is of 
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order t. Hence if W is to be a minimum, it is clear that we 
must make the leading term of W, namely 

H HR ?!'** 

vanish if possible. But this quantity vanishes if, and only 
if, U n =U 0 (r), and by thus restricting U 0 the stated boundary 
conditions are not violated. 

When U 0 is thus restricted, the double integral W will 
involve no negative powers of t , and will have as leading 
term :— 

'0^< v M U -' + T + 87]' 

where accents denote differentiation with respect to r. The 
part of the integrand in braces may be written as the sum 
of squares:— 

We turn next to the choice of ^7 , which is an arbitrary 

function of r and z still at our disposal. If wo call this 
variable x, the above expression involves x in two terms of 
tho form 

\(m + x)* + 2 f u: i . 

Elementary calculation shows that for ,v= ^ l0 ^ ore " 

going expression has the minimum value ro*. Hence 

we must take + ^ 


dlCl _ —X / yy t U 0 \ 

aT-xT^l Uo+ r)’ 


or 


147,= 


X + 2/a 
-X c 


(U.'+ y°)+ -<('), 


X + 2/a 

s(r) arbitrary. But tc(r, 0)=0 by (B) ; lienee s(r)= 0. 
This choice of ic, does not interfere with the boundary con¬ 
ditions. 

The terms written above now reduce to give 

*-r r. am + '‘ > ( u -” + v> xu -' ""} * j - 
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and an integration with regard to z can be explicitly per¬ 
formed. The principal part of W takes the form YVV, where 

Wl =xT^f {(* + M)(U 0 ” + 5£) + XU 0 ' } ar dr. (1) 

This integral is to be made a minimum subject to the boun¬ 
dary conditions 

U 0 (a) = e, V 0 (b) = 0. 

Accordingly our problem is reduced to a simple problem in 
the Calculus of Variations. The condition $W, = 0 gives 
at once 

d d<l> 

rfr§u;'-Wo =0, . (2) 

where & is the integrand in (1). Writing out this equation 
in full, we obtain 

& [ 2 (*• 4 /*) u o'+ x y]*r — [2(X+/*) + \U„']« - 0. 

Consequently U 0 must satisfy the following differential 
equation :— 

■ ,3, 

where 2h = 2ta stands for tho thickness of the plate. 

In the case of a plate of constant thickness, h is constant 
and h = 0, and (3) reduces to a well-known form (Love, 
p. 141). 

Thus far wo have determined the displacements to be 

U = U 0 (r) + U/ , +.,.( 4 ) 

“ = “x?^( u# ' + v 0 ) <+,f ‘* ,+ —> • ( 5 

where it is to be remembered that the accent on z has been 
suppressed. 

We proceed to determine U„ from the fact that the body 
forces F r , F- must vanish. We have (Love, p. 141) : 

- pI '„ (X +W (lT + £' - ”)+ + 1 W =0, 

.(C) 

p ‘ t* d-' a + t bc( u + 

K 19»o h 2 w\ 

r^ + 5^) = °- • < 7 ) 
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Substituting (4) and (5) in (6) and (7), the terms in I"*mid 
t' 1 vanish. From the constant term in (6) we obtain 

B’U, 3\ + 4 M / „ U 0 ' U„\ 


whence, using (B), 


Also, the constant term in (7) vanishes. 

Furthermore, the surface tractions must vanish on the 
froe surfaces c = a, z = —ol ; lienee (Love, p. 76; the following 
equations must hold on these surfaces :— 


X,= X x cos (a-, v) + X, cos (y, r) + X.. cos (z y v) = 0,1 
Y„= Y,cos (*, v) + Yj,cos (y, v) + Y*cos (s, v) = 0, 
Z„ = Z x cos (ar, v) + Z y cos (y, v) + Z- cos (r, v) =0. 



If we consider a tangent plane at an arbitrary point of the 
surface z=a , the direction of the normal is denoted by v, 
and (8) gives the tractions across the tangent plane in terms 
of the stress-components across planes parallel to the rectan¬ 
gular coordinate planes. For cylindrical coordinates, wo 
have (Love, pp. 100, 141) :— 

X,=XA + 2 / i*„«XA + 2 / *|£, 

\ y = XA + 2/*#»yy=XA -f 2^ ~ , 

Z- = XA + 2/^x- = XA + 2/t^g-7 , 

Y z— Zy = /^jfr = 0, 

Z'= x -=^-'=Kr^ + dT> 

X y =Y,=/ie J y=0, 

where 


To determine the direction cosines, wo return to our 
original variables and observe that tan 6 — ta, 
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Hence we find along the x axis 
cos (*, V) = , cos (y, ,) = 0, cos r) = 

Fig. 2. 



Hence the conditions that the tractions vanish on the free 
surfaces become 


-<a'X x + X, = 0,.(9) 

-to! Z* + Z,=0.(10) 


As we proceed to higher order terms, using (6) und (7), 
the relations (9) and (10) may be expected to play an im¬ 
portant r61e in furnishing differential equations to determine 
the arbitrary functions that enter. In the present case there 
is no constant term in either left-hand member; and by 
virtue of (3) the terms in t also vanish. 

Thus the surplus body forces per unit of volume vanish 
up to order t f and the surplus surface traction? per unit of 
area vanish up to order P. Hence the total surplus applied 
force is of the order t *, whereas the total given radial force is 
of order t. It seems clear that when the surplus applied 
forces are removed, no sensible change occurs in the radial 
or axial displacement, aud that we have a solution of our 
problem. 

It remains to compute the principal part of the radial 
pressure on the inner edge of the plate. Here we make use 
of baint-Venant s principle and obtain the resultant traction, 
X,(r, z)> as 


fT la ’ X x (a, 0) a,lr ,10 = Slra P(*+n)/‘(a) 
Jo J —ta(a) 


[ 


Uo'(a) + 


2(XH-ix) a 


Uo(a) j 


If we desire to take t = l throughout, so that the equations 
of the bases ore z=a and r=-«, then t disappears as an 
explicit parameter, but the terms of the solution are still 
ordered according to the powers of the natural parameter 
namely the ratio of the thickness of the plate to its diameter ’ 
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Case II .— 'Ihe Complete Plate under Axial Pressure. 

As a second example consider a thin circular plate, nearly 
plane, damned at ine outer edge and subject to an axial 
force P. Here a small force P yields a relatively largo 
displacement, and our method is put to a more delicate test. 
Again adopt cylindrical coordinates (fig. 3). 

Fig. 3. 


i 



i 


The plate is not restricted to be symmetrical. Accordingly 
let a and be proportional to the distances of the bases of 
the plate from the plane z=0, so that z = ta and z = tfi aro 
the equations of the lower and upper bases respectively. 

The boundary conditions are the following :— 

U(0,0) = 0, u>(0,0) = e, 

w(a ,o)=^-2)=o. 

Again, wo take the formula for potential energy us the 
point of departure. Tho term in t~ l is now 

"jo’ J>+ 2/t) [%T + } r d: dr - 

This term can only be made to vanish by setting 
w 0 =w 0 (r), U 0 =U 0 (r), 


and the boundary conditions are not thereby violated. 

The constant term in W disappears and we can writo tho 
integrand of the term in t as the sum of squares :— 






In the present case we are led to attempt to make this inte¬ 
grand vanish, since tho case of an ordinary plane plate shows 
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that the energy will be of order t 3 , Thus we must take 

U 0 = 0, ifli = ujj(r), U,= - zw 0 ' + p(r), 

where p(r) is arbitrary. None of these requirements violate 
the boundary conditions. 

The physical significance of the conditions thus far obtained 
is immediate ; the displacement is transverse, and such that 
filaments perpendicular to z=0 remain perpendicular in the 
displaced posilion and are not compressed transversely. 

When these conditions are imposed, it is easy to sec that 
the energy is of the order t 3 . More precisely, the leading 
term in W now becomes 

-ff {(^w[-.W + ^ + =^ + <g’]' 

The part of the integrand in braces may be written as 

x[-z U , u U P ^ + =™^ + *g] , 

+ 2 M [(-*V'+pT+ £(-*«„' +p)*+ (£■)’] 

Since U 2 appears only in the last term, and since W is to 
bo a minimum, we choose 

U 2 = —zwi' + q(r). 


Wo proceed next to the choice of and find (by the 

samo method as employed in the first example) 



= x+fc(-*< + J' , + 




< r ) arbitrary. It may be noted that the choice of U„, 
docs not interfere with the boundary conditions. 
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The integral written above thus becomes 

"tCX {xrfc [( x+2 '*> {(-w+P')*+ p '+py } 

+ M-ciOo" + p') ( ~ Z ’°i + - P )] } rdrit, 

and an integration with regard to z can be explicitly per¬ 
formed. To this end it is convenient to introduce a new 
variable z\ where 


z = m + z\ = m 4- A, a = m — h; 

at the same time we replace p by v , where 

hv= — mwQ 1 +Pi 

so that hvt is the radial displacement for the mean surface 
z = m. With this notation the principal part Wj£ a of W has 
a coefficient. 




+ ("'+ \ " + ?”•')’ +3 
«•■)]}*■*• ■ • <»» 


h' m 


It is to be observed that -j L are independent of the 
thickness of the plate. 

The integral is to be made a minimum subject to the 
boundary conditions 


«'o(0)=e, u> 0 (a) = 


0, W(a) = 0;| 

„(0) = v(a) = 0.j* * ' K } 


The form of W shows that w 0 '(0) vanishes also; otherwise the 
integral would be infinite. 

Our problem is now one in the Calculus of Variations, and 
the condition $W,=0 gives us 


dr'dwo" 9u> 0 ' ’ dr"dV Qv ’ 


. (13) 


where 4> is the integrand in (11). We will not write out 
these equations. The system of two equations is of the fifth 
order. The five arbitrary constants in the general solution 
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are determined by (12). One of these is an additive con¬ 
stant in w 0t since w 0 does not occur explicitly. 

For the case of a symmetrical plate we have m' = 0, and 
the variables separate. The equations assume the simpler 
form :— 

i [»•{*<*+<•) T+’fc'}]-* 1 ' 

+-*» ("■ + J -) 

+ 2(X + /*) ^ v' 4 2 J v ) +0. 

If, furthermore, the plate is of constant thickness, the 
equations in w 0 ' and t* become the same :— 

r V' + rv' — v =0, r*w 0 "' + ru> 0 '' — w 0 ‘ = 0. 

This differential equation for w 0 coincides with that obtained 
in the usual theory (Love, p. 494). 

When w 0 and v have been determined in the general cas<*, 
the displacements are given by the formulas :— 

U= — ( z—m)v}ft + hvt+ .. 

"= Wo+w,t + xTv [s i w °" + T )“* {,7r + hv ) 

It seems possible to proceed with tnis example, as in the 
earlier case, by considering the body forces and surface 
tractions, and determining terms in U and tv of higher order. 

The problem of the thin circular plate under special con¬ 
ditions has thus been formally treated by means of the 
following method, apparently applicable to thin plates, shells, 
and beams :—(1) introduction of a small parameter t to 
represent distance from a fundamental mean surface or line ; 

(2) expansion of the Lagrangian function T —U, the dis¬ 
placements, the body forces, and the surface tractions as power 
series in t ; (3) determination of the early coefficients so as 
to make this integral a minimum of as high order as possible 
in t ; and (4) determination of later coefficients so that the 
body forces and surface tractions have the required values. 
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NATURAL ISOPERI METRIC CONDITIONS IN TIIE CALCULUS 

OF VARIATIONS 

By G. D. Birkhoff and M. R. Hestenes 
Department of Mathematics, Harvard University 
Communicated January 12. 1935 

1. Introduction. -In the present note we formulate the following 
principle: 1 

Every extremal of a positively regular integral J satisfying a prescribed 
set of end conditions affords a minimum to J, relative to neighboring admissible 
arcs satisfying these end conditions and a suitably chosen set of isoperimetric 
conditions which luive the further property that every extremal of J satisfying 
the prescribed end conditions also satisfies these (natural) isoperimetric 
conditions. 

Thus we see that under this principle every extremal can be basically 
characterized by the minimum number of natural isoperimetric conditions 
which are needed to reduce it to one of minimum type. 

The chief purpose of this note is to define such “natural isoperimetric 
conditions” for the fixed end-point problem in the calculus of variations 
and to give some illustrations of its numerous applications. The results 
described in this paper can be extended readily to variable end-point 
problems and to problems concerned with closed extremals, reversible or 
irreversible. 

()ne of the special features of the use of natural isoperimetric condi¬ 
tions is that it enables one to obtain easily and naturally the usual oscilla¬ 
tion, separation and comparison theorems by classical methods of the 
Calculus of Variations instead of by the use of the accessory boundary 
value problems, as has been done heretofore. In fact, the use of artificial 
orthogonality conditions becomes unnecessary since these arose only be¬ 
cause the natural isoperimetric conditions had not been used. 

2. Natural Isoperimetric Conditions. —Consider a class of arcs joining 

two fixed points 1 and 2 in (x‘. x") space on which the integral 
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J - //(, i)dt 

is well defined. Let H{x, x) be a function satisfying the conditions 
H(x, kx) = //(x, x)(* > 0), //(x,, y) = //(x t , 2 ) 
for all (y) 4 1 (0) and (z) 4= 0, and let v>'(x, x) be a solution of the equations 

ft* = 

The condition 

K * £ {/*'*' + (//x* - = 0 

will be called a mi/wra/ isoperimetric condition. It is satisfied by every 
extremal of 7 joining the points 1 and 2. A simple and important case is 
the one in which II = *>'(x)/;'; in this case K takes the form 




0 . 


( 1 ) 


We have the immediate theorems: 

Theorem 1. An admissible arc joining the points 1 and 2 which has 
continuous first and second derivatives and satisfies all natural isoperimetric 
conditions is a solution of the Euler equations. 

Theorem 2. Let E„ be an extremal arc (not intersecting itself) whose end 
points are not conjugate and which satisfies the strengthened conditions of 
Weierstrass and Legendre. Let m be the sum of the orders of the conjugate 
points of the point 1 on E„. Then there exists a set ofm (and no fewer) natural 
isoperimetric conditions of the form ( 1 ) such that £„ affords a proper strong 
relative minimum to J in the class of arcs joining its end-points and satisfying 
these m conditions. 

Thus we see that every extremal of the integral J can be considered 
as one ot minimum type in a suitably chosen natural isoperimetric problem. 

-1. The Accessory Problem and the Accessory Boundary Value Problem.— 
It is not likely that there exist always a set of natural isoperimetric condi¬ 
tions such that every isoperimetric extremal satisfying the end conditions 
is an extremal of the integral J. Nevertheless these do exist in certain non- 
parametric problems, such as the accessory problem associated with the 
second variation of an integral J in non-parametric form. Here we study 
an integral of the form y 

= f xx 2 Vt v ') dx 

where 2a, is a quadratic form in and and the functions ,,(*) vanish 
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at X\ and x 2 . The natural isoperimetric condition analogous to (1) is of the 
form 

Mt. k) = J” |r<«* + = 0 (2) 

where fr(x) vanishes at x x and x t . If there are m conjugate points of x x 
between x x and x 2 and if the strengthened Legendre condition holds, then 
by the adjunction of m (and no fewer) suitable natural isoperimetric con¬ 
ditions the arc ( ij ) = (0) becomes a minimizing arc relative to these condi¬ 
tions. This result is true even if the point x 2 is conjugate to x, and is 
obtained by the use of classical methods in the Calculus of Variations. 
Thus we can count the number of conjugate points of Xj between x x and x 2 
by counting the minimum number of isoperimetric conditions necessary 
to reduce the arc (>?) s (0) to a minimizing arc. 

Heretofore in the study of the accessory boundary value problem 

(d/dx)w n .' - <*>,. + or), = 0 t?,(xi) = i 7 i(x 2 ) = 0 

it has been a common procedure to adjoin isoperimetric conditions of an 
artificial type so as to make the arc ( rj) s (0) a minimizing arc. The num¬ 
ber of conditions needed is then equal to the number of negative charac¬ 
teristic roots a < 0, each counted a number of times equal to its multi¬ 
plicity. These artificial conditions can be replaced by the natural iso¬ 
perimetric conditions (2) and so we obtain a new proof of the following 
result due in the general case to Morse: 

Theorem 3. The number of characteristic roots o £ 0 is equal to the sum 
of the orders of the conjugate points of x, on x x x 2 . 

4. Oscillation, Separation and Comparison Theorems.— Since natural 
isoperimetric conditions afford a method of counting the number of conju¬ 
gate points on an interval it is clear that they lead to oscillation, separa¬ 
tion and comparison theorems as well. A typical oscillation theorem of 
apparently new type is the following: 

Theorem 4. There exist (at least) m conjugate points of x, between x x 
and x 2 if and only if there exist a set of m admissible variations m a M (<* = 
1 , . . ., m) of the original problem vanishing at x x and x 2 for which 

Mv aO = Mva, Ve)*a*0 < 0 M = 1. • • *1 »0 

for every set of constants (x) =£ ( 0 ) where 7 s (f, ij) is the integral ( 2 ). 

The key point of the proof of this theorem is that J 2 (v a . v) = 0 are a 
set of natural isoperimetric conditions. 

5. Problems Admitting a Group of Automorphisms. —Frequently an 
extremal E x2 has associated with it a group of automorphisms; in such a 
case there is in general a least sub-arc E n of E x2 with the property that every 
point on E x2 is the image of a point on E u under this group. If we compare 
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the value of the integral J on Eu with neighboring arcs under the same 
group and satisfying the prescribed end conditions, then it is clear that the 
number of natural isoperimetric conditions needed to reduce the arc £12 
to one of minimum type in this class depends only on the nature of the sub¬ 
arc En and the group. In many cases if there are / conjugate sub-arcs the 
indices and number of natural isoperimetric conditions for a fundamental 
sub-arc at most equal those for the whole arc (regardless of the group) 
divided by l. 

For example in the restricted problem of three bodies (see the next 
section), the group is given by (x, y) —► (=*=x, *y), and / is 2. 

5. An Application of Natural Isoperimetric Conditions .—Natural iso¬ 
perimetric conditions not only give rise to a simple characterization of 
extremals but also give us information concerning their location. A good 
example of this is afforded by the restricted problem of three bodies dis¬ 
cussed by G. W. Hill. Here we are concerned with the closed extremals 
of the integral 

j =f c \*y'-yx' + (2tl - «)'*(*'» + /•)*)* 

where 2Q = 3x 5 + 2(x 2 -f y t )~ >/t . A natural isoperimetric condition 
in this case is the condition 

K -J' i (yO,-xQ y )(2U-c)- l/ '(x'= + y'xp'dt = 0 

as one readily verifies. It is clear that on every closed extremal E the 
expression 

yttx - xil y = 3 xy 

must change sign. Hence every closed extremal whatsoever must cross 
one Of the coordinate axes. 

* See a paper by Birkhoff Sur U probUme reslreint des trois corps about to appear in the 
Pisa AnnoU, in which the possibility of natural isoperimetric conditions is alluded to. 
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NATURAL ISOPERIMETRIC CONDITIONS IN THE CALCULUS OF 

VARIATIONS 

By G. D. Birkhoff and M. R. Hestenes 
Introduction 

In the ordinary problems of the Calculus of Variations, an extremal is defined 
in general as an arc along which the first variation of the prescribed integral J 
vanishes. Such an arc is completely characterized by the fact that it satisfies 
the attached Euler differential equations and transversality conditions. 

Among such extremal arcs there are certain ones which furnish a minimum 
value of J under the prescribed boundary conditions. It has always been a 
matter of primary theoretical interest in the Calculus of Variations to determine 
when such a minimum is realized. In the case of a minimum the minimizing 
arc can be found by use of a minimizing principle. 

For extremal arcs not furnishing a minimum the attached quadratic accessory 
minimum problem no longer is analogous to a positive definite quadratic form. 
It has long been recognized that the number of negative terms, or ‘type number’, 
in this normalized form is equal to the number of negative characteristic values 
in the associated linear self-adjoint boundary value problem. This fact has 
been especially apparent ever since the formulation of the theory of homo¬ 
geneous linear integral equations with real symmetric kernels. It is implicitly 
involved in the work of Birkhoff 1 without regard to the boundary value prob¬ 
lem, in particular for the case of type number zero (minimum) and type number 
one (minimax). Morse* has subsequently developed the notion of type number 
systematically, also using the method of broken extremals. 

The question arises as to whether or not all extremals whatsoever may not 
be obtained as a solution of a properly formulated minimum problem. In the 
present paper we show that this is indeed the case. More definitely, we show 
that by adding a suitable set of ‘natural isoperimetric conditions’, which are 
automatically satisfied by every possible extremal fidfilling the given conditions, 
there is obtained a related isoperimetric problem for which the extremal arc in 
question is a minimizing arc.* 

So far as we know, the only case in which such natural isoperimetric condi- 

Received March 25, 1935. 

1 See his paper Dynamical systems with two degrees of freedom, Trans. Amer. Math. Soc., 
vol. 18 (1917), in particular, pp. 239-257. 

* For references to the works of Morse, see his book The Calculus of Variations in the 
Large, Colloquium Publications, American Mathematical Society, vol. 18 (1934). Unless 
otherwise expressly stated, all references to Morse in the following pages are to his book. 

* See our paper in the Proceedings of the National Academy of Sciences, February, 
1935, with the same title, in which this general principle is first formulated. 
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tions have been used similarly is in a paper by Poineard 4 in which be proves 
the existence of a closed geodesic on any closed convex analytic surface. It is 
obvious that such a geodesic is not of minimum length. However, by adjoining 
the condition that admissible curves divide the surface in question into two 
parts of equal integral curvature, he proved that there exists a shortest curve 
of this type which is the desired closed geodesic. The ‘natural isoperimetric 
condition’ here entering is automatically fulfilled along all closed geodesics and 
may be reduced to one of the kind which we consider. It may be noted, inci¬ 
dentally, that the type number of such a minimizing geodesic is one, correspond¬ 
ing to the fact that only a single condition has been imposed. 

In the present paper we define first such natural isoperimetric conditions as 
seem to be of the most useful type. Conditions of more general type are 
described later. Furthermore, we have not endeavored to treat the difficult 
question of the existence of natural isoperimetric conditions valid in the large. 

With our approach the type number appears naturally as the minimum num¬ 
ber of such natural isoperimetric conditions which suffice to reduce tin* given 
problem to a minimizing problem. It is then proved subsequently that the 
definition so obtained agrees with that referred to above. At least in the case 
of one dimension this attack renders unnecessary all reference to the accessory 
boundary value problem. It has the further advantage of remaining wholly 
in the classical domain of the Calculus of Variations and of avoiding the use 
of the method of broken extremals. 

It might be expected that the isoperimetric problem arising on the adjunction 
of such conditions would require the usual isoperimetric theory for its solution. 
It is shown here that this is not the case, so that the theory can be made to 
depend upon the ordinary theory of the minimum whether in parametric or 
non-parametric form. 


In (he proof of the minimizing properly here given an interesting lemma in 
the ordinary Caleulus of Variations is established; namely, that if the ordinary 
conditions for a minimum in the parametric ease with fixed end points is satis¬ 
fied along an extremal are PQ, then, even if all arc* PQ in the neighborhood of 
any segment of the yarn extremal, no matter how far extended, he admitted, the 
arc PQ will still furnish the absolute minimum in the extended neighborhood 
A further interesting result is that in the variable end (mint ease a non-degen¬ 
erate extremal arc E„ can be reduced to a minimizing are by a suitably chosen 
set of natural isoperimetric conditions even if its end points are conjugate 
Naturally enough our attack upon the general problem is made by considering 
first the allied accessory problem involving certain accessory natural isoperi¬ 
metric conditions. In this manner it is possible to obtain the oscillation 
and comparison theorems of Morse ... more complete form, as well as a new 

snond to" 1°°™ !1‘ §6 ' f U T '* rcmark " 1 »'“< •« of these results corre¬ 
spond to rather evident facts about ordinary quadratic forms and from this 

the latter part of SK}.’” ^ ‘ <,905 >' I ’" «• however, sec 
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point of view can be predicted heuristically. As an instance of this parallelism 
we cite the following. If we vary the coefficient of a homogeneous quadratic 
form from one of positive definite type into another not of positive definite 
type, the associated linear system of equations for the characteristic values will 
vary likewise and k negative roots will be introduced if the final type number 
is k. A simple illustration of such an application in the one-dimensional case is 
obtained by allowing the interval to expand continuously, in which case a 
characteristic value ap|>ears with each successive conjugate point. 

As is demonstrated in the present paper, the theory of natural isopcrimetric 
conditions extends to problems In higher dimensions. A typical two-dimen¬ 
sional case is considered; but in order to treat it we have used the accessory 
boundary value problem; and we have limited ourselves principally to this 
accessory problem. It is to be hoped that ultimately the use of the accessory 
boundary value problem can be avoided, so that sufficient as well as necessary 
conditions for a strong minimum can be independently obtained, at least in 
simple cases. If this were accomplished, it would be possible to apply the 
Calculus of Variations directly to the discussion of the solutions of linear self- 
adjoint differential systems under self-adjoint boundary conditions. 

For the case of higher dimensions, it is obvious that the use of natural iso- 
perimetric conditions is more satisfactory than that of broken extremals which 
have no simple analogue in the case of multiple integrals. 

I 

The accessory problem in the fixed end point case 5 

In §§1-6 we shall study the integral 

J(*l) = f' 2w{x,ii,ri')dx 

in (x, 171 , • • • , i?*)-space subject to the end conditions 

v .(*») = 0, n<(**) = 0 (i ■ 1, • • • , n), 

where* 

2w = P, k (x)wt + 2Q*Cr)n,i|* + Ru(*)iWk (*» k • * *» n ) • 

This problem arises in the study of the second variation for the fixed end point 
case and is usually referred to as the accessory problem. A complete discussion 
of this problem is essential for the study of the more complicated problems 
discussed in this paper. The principal result obtained in these sections is that 
of showing that if the strengthened condition of Legendre holds, then the mini- 

4 The various parts of this paper need not be read consecutively. If one is interested 
only in the fixed end point case, it is sufficient to read parts I and III. These parts give 
the basic principles underlying our work. 

• Here and elsewhere a repeated index denotes summation with respect to that index. 
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mum number of natural isoperimetric conditions needed to reduce the arc 
( t ]) s (0) to a minimizing arc is equal to the sum of the orders of the conjugate 
points of X\ between x x and x 2 . 


1 . Preliminary definitions and theorems. A function /(z,, • • • , x n ) will 
be said to be of class C" if it is continuous and possesses continuous derivatives 
of the first n orders. We shall assume that the functions P ik = P k < are con¬ 
tinuous and that the functions Q, kf /?,* = R ki are of class C' on the interval 
X\X\. We admit functions »j,(x) which are continuous and are composed of a 
finite number of segments of class C'. Such functions will be said to be of 
class D'. A set of functions *,(*) of class D' which vanish at x, and x 2 will 
be called a set of admissible variations for the fixed end point problem here 
considered. 

The Euler equations of the integral J(r)) are the equations 
( 1 - 1 ) LM - - (d/dx)*y = 0 (t = 1 , • • • , n), 

and are commonly called the accessory or Jacobi equations. A solution t?,(x) 
of these equations of class C" will be called an accessory extremal. From the 
theory of linear differential equations of the second order it follows that if the 
determinant | R ik (x) | is different from zero on z,x,, then an extremal is uniquely 
determined by the values of *, or of v% , f. - at a single point x„. 

A value x, * x, will be said to be conjugate to x, if there exists an accessory 
extremal Vi having i>,(x,) = « 7 ,(x 3 ) - 0 and („) (0) on x,x 3 . By the order 

of x 3 as a conjugate point of x, will be meant the maximum number of linearly 
independent accessory extremals having iy,(xi) = 17 ,(x 3 ) = 0 . 

It is well-known that if the determinant | R xk | is different from zero, 
then there exists a set of 2 n linearly independent accessory extremals rj, .(x) 
(« - l f • ” , 2 ») and that every accessory extremal is expressible linearly with 
constant coefficients in terms of these 2n extremals. Moreover, the points x 3 
conjugate to x x are the zeros x a ^ x, of the determinant 

!*<•(*) I 


It follows that if the point x, is not conjugate to x„ then every pair of points 
( f«» w* can bc j° ined b y a un «que extremal arc. Furthermore, it is 

clear that there exists a set of n linearly independent accessory extremals 
Vik } X) l >r‘ • satisfying the conditions i,.>(x,) = 0 and that every 

extremal * having n.(x,) = 0 is expressible linearly with constant coefficients in 
terms of these n extremals. The conjugate points x 3 of x, are then also deter¬ 
mined by the zeros of the determinant D(x) = | tj u (x) | 

It should be noted also that if the determinant | /?„ | is different from zero, 
and ,.(*) is a solut.on of equations (1.1) of class D’ such that the functions 
f, are continuous on XiX*, then rjjx) is necessarily an accessory extremal. 
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2. Natural isoperimetric conditions. Let fi(z) be an admissible variation 
of class C" and let 

(2.1) Q(L v) = 


The condition 


J(t> i> 


f"' n({, v) dz = o 


will be called a natural isoperimetric condition for the functional J(v)- It 
satisfied by every accessory extremal m, since in this case we have by integration 
by parts with the help of equation (1.1) 

J({, „) = ["(d/dz)U,», : ) dz = *’ = 0. 

J* I 


Consider now a set of m natural isoperimetric conditions 

(2.2) 0 (or « 1, ... ,ro). 

The Euler equations of J(v) subject to these conditions are the equations 

(2.3) Li(rt + m.*.) =0, n'a = 0 (a - 1, • • • , m). 

A solution vi, Ha of class C" of these equations will be called an isoperimetric 
extremal. 

The set of conditions (2.2) will be called a proper set of natural isoperimetric 
conditions if the determinant 

| J(ta, 6) I 

is different from zero. Concerning such sets we have the following interesting 
theorem: 

Theorem 2.1. If the set (2.2) forms a proper set of natural isoperimetric con - 
ditions, then the multipliers Ha belonging to an isoperimetric extremal i u, Ha satis¬ 
fying the conditions (2.2) are all zero. 

For let Vi, Ha be an isoperimetric extremal satisfying the conditions (2.2). 
By the use of equations (2.3) and integration by parts it is found that 

0 = J(U v) = [ 1 liiaLfa) + (d/dx)U,.<*>,;)I dx 

J*i 


= ^ dx = - J ^ a ’ 

It follows that Hf> = 0 and the theorem is established. 

We have further 

Theorem 2.2. If n.o(x) is an admissible variation such that the equation 
(2.4) i) = 0 
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is true for every set of admissible variations 17 , of class C" satisfying the conditions 
( 2 . 2 ), then there exists a set of constants n f , c, such that the equations 


(2.5) 




/; 


dx + a 


hold on hit with 


Vi = V .0 + 


For Jet vi,(x) (v — l f • • • , m + 1) be a set of m + 1 admissible variations 
of class C". If the hypotheses of the theorem are satisfied, then it is clear that 
the two matrices 


f(vo, vf) 

J&, V ,) 


II Jlu, v.) II 


must have the same rank for every set of w + 1 variations m,. It follows that 
there exists a set of constants /i« such that 


J(vo, v) + v) « J(vo + v) - 0 

for every admissible arc > 7 , of class C". In fact this equation must hold for 
every admissible arc since every such arc is a limit curve of some sequence of 
admissible arcs of class C". The theorem now follows from the fundamental 
lemma in the Calculus of Variations. 

Corollary 1. Suppose that the determinant |/?,*| is different from zero on 
XiXt and the set (2.2) forms a proper set of natural isoperimetric conditions for J{rj). 
Let vM he a set of admissible variations satisfying the conditions (2.2). If the 
equation (2.4) holds for every admissible arc > 7 , of class C" satisfying the condi¬ 
tions ( 2 . 2 ), then rt i0 is an accessory extremal. 

For, according to the remark at the end of the last section, the non-vanishing 
of the determinant | ft,, | implies that the arc is an extremal arc, 

since it satisfies the conditions (2.5). The corollary now follows from The- 
orem 2 . 1 . 

Corollary 2. If the determinant | /?.* | is different from zero on x x x h then an 
admissible arc > 7 , 0 ( 1 ) which satisfies all natural isoperimetric conditions is neces¬ 
sarily an accessory extremal. 

This result follows from Corollary 1 since in this case the condition (2.4) 
must hold for all admissible variations > 7 , = $,-(*) of class C". 


3. Minimal sets. A set of natural isoperimetric conditions 

(3,1) *) = 0 (a = 1 , • • • , m) 

will be called a minimal set, if the inequality J(,) ^ 0 is true for every set of 
admissible variations 17 , satisfying the conditions ( 3 . 1 ) and if no proper sub¬ 
set of these conditions has this property. One readily verifies that if the set 
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(3.1) is a minimal set, so also is the set J(io, v) = 0, where l, a = £*/t 0 * and 
\A m ,\ * 0, the A's being constants. 

Theorem 3.1. If the set (3.1) is a minimal set, then the inequality 

J(l., tthoa 9 £ 0 (a, 0 - 1, ••• ,m) 

is true for every set of constants (a) ^ (0). 

For we may suppose that the set (3.1) has been chosen so that 

(3.2) J(U U ) =0 (* * « • 

Suppose now one of the numbers {«), say •/($-, £-»)» were positive. Let 
17 ,( 1 ) be an admissible variation satisfying the first m — 1 conditions (3.1), 
having v) * 0 and J(n) < 0 , and choose a constant 6 such that 

J(U v) * bJ(U *.) - fc/tt-) • 

The arc »n - would then satisfy the conditions (3.1) and have 

j( v - l>i m ) = •/(*) - 26 J($„, n) + 6V(U> 

= J(q) - W(|J < 0 , 

contrary to our assumption that the set (3.1) is a minimal set. This proves 
Theorem 3.1. 

Corollary. If the set (3.1) forms a proper minimal set, then the inequality 

(3.3) •/(*., <0 (a, 0 = 1, • • • , m) 

is fri/c for every set of constants (a) ^ (0). Conversely, if the set (3.1) forms a 
maximal set of isoperimetric conditions such that the condition (3.3) holds, then 
the set (3.1) is a proper minimal set. 

The following theorem shows the importance of proper minimal sets. 
Theorem 3.2. If the set (3.1) forms a minimal set, there exists a proper 
minimal set composed of the same number of natural isoperimetric conditions. 

For suppose the condition (3.2) holds and suppose J(im, £«) = 0- Let 
be an admissible arc satisfying the first m - 1 conditions (3.1) and having 
J(l) < 0. The set 

(3.4) J(S, v) = 0, </(«., n) = 0 (a - 1, • • • , m - 1) 

can now be shown to be a minimal set except for the fact that may not be 
of class C". For if there were an admissible arc n, satisfying these conditions 
and having J(q) < 0 , then by choosing a constant 6 such that 

J(S~, v) = 0 

it would be found that the arc q, - fc*. would satisfy the conditions (3.1) and 
would have 

•/(, - 6{) = ■K.’I ) - 2 W({, 1 ) + 6V({) = JM + 6V(t) < 0. 
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This is impossible. It remains to show that none of the conditions (3.4) can 
be dropped. Suppose the last one could be dropped and let ij. be an admissible 
arc having/(*) < 0 and satisfying the conditions (3.1) except for the (m - 1) st. 
By choosing a constant 6 so as to satisfy the equations 

AS, V) = bJ(i, *.) 

it would be found that the arc n, - hfwould have J < 0 and would satisfy 
the equations (3.4) except for the last. The set (3.4) is accordingly a minimal 
set. By a repetition of this process we can eliminate all the numbers J(S„, f«) 
which are zero and obtain thereby a proper minimal set as described in the 
theorem, since the functions as obtained which are not of class C" can be 
replaced by functions of class C". 

In view of this result we may restrict ourselves to proper minimal sets when 
questions concerning minimal sets arise. 

Theorem 3.3. The number of conditions in a minimal set is the same for 
every such set. 

For if the sets 

v) = 0 (a = 1, • • • , W|), 

*) - 0 0> = l, ••• , «!,) 

form two proper minimal sets and m t were greater than m h then there would 
exist constants a g not all zero such that 

= 0 . 

The arc m = VifiOf would then satisfy the first of these sets and have 

= ■/(>?* *7>)a-Kz, <0 (Q, 7 = 1, ... , ,„,) 

by the corollary of Theorem 3.1. But this is impossible. Hence w, < Wl| 
and the theorem is immediate. 

The number of conditions in a minimal set will Im* called the type number of 
the integral J. If no minimal set exists for ./, then we shall say its type num¬ 
ber is infinite. In view of Theorem 3.3 it is clear that the type number of ./ 
is a well defined concept. 

The following theorem gives a second method of evaluating the tvpe num¬ 
ber of J. 

Theorem 3.4. Let (0 = 1, ••• , m) be a set of admissible variations 
satisfying the condition 

( 3>5 ) vs)a 0 a 3 <0 (a, d = 1, • • • , ///) 

for every set of constants (a) * (0). The number of variations in a maximal set 
of variations of this type is equal to the type number of 

I Ins follows because if rj,j forms a maximal set of variations satisfying the 
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condition (3.5), then we can replace these functions by admissible variations 
of class C" satisfying the same condition. The conditions 

v) = 0 09 = 1 , ... , m) 

would then form a minimal set by the corollary to Theorem 3.1. 

Corollary. If the condition (3.5) holds for a set of admissible variations 
Vis (0 = 1, • • • , m), the type number of J is at least m. 

The condition of Legendre will be said to hold on x,x 3 if the inequality 

(3.6) Rik(x)wiw k £ 0 (t, k m 1, ... , n) 

is true for every set of constants (») ^ (0). If the condition (3.6) holds with 
the equality sign excluded, the strengthened condition of Legendre will be said 
to hold on x x xi. 

Theorem 3.5. If the type number m of J is finite, the condition of Legendre 
holds on XiXt. Moreover, if the determinant \ R a | is different from zero on x x x t , 
the strengthened condition of Legendre holds. 

For, suppose there exists a point x* on x t xt and a set of constants (jr) such 
that R tk *{Wk < 0. The inequality 

(3.7) 2«( j , pw, w) < 0 

accordingly holds at x = x t for p = 0 and hence also on an interval x'x" for 
all values of p which arc in absolute value less than a small constant p 0 . Divide 
the interval x'x” into m (m arbitrary) successive intervals h„ (a = 1, • • • , m) 
of equal length and denote the center and the length of h a by t a , 21 respectively. 
Choose the constant 6 as the smaller of po and / and set 

-(*-*.+ 6)»< ft. - a £ * £ to), 

Vto = (-1 + to + 6)w< ft.^l^. + 6), 

and Via = 0 elsewhere. From the inequality (3.7) it follows readily that 

J(rio) = J(lo, Vo) = f 2u(x, Vo, v’o) dx < 0 . 

Moreover for a * 0 the expression Q(i? a , vs) is identically zero and hence 
J{vo, va) = 0. It follows from Theorem 3.4 that in this case the type number 
of J is at least m and hence infinite since m is arbitrary. This proves the 
theorem. 

We shall show in §5 below that if the strengthened condition of Legendre 
holds, the tyjK* number m of J is finite and is equal to the sum of the orders 
of the conjugate points of i\ between X\ and jj. 

4. Two lemmas. In this section wc shall establish two lemmas which will 
be useful in proving the existence of a minimal set. For this purpose we shall 
assume that the strengthened condition of Legendre holds. 
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It is well known that for two accessory extremals u, the expression 
(4.1) l&u'i - 

is a constant, where the arguments in ay., are respectively ij„ 17 ' and u„ u'. 
This fact follows at once from the relations 

u) = fi(u, 17 ), Sl(r,, u) = i 7 iL,(u) + (d/dx){ruw u -). 

If the constant (4.1) is zero, the extremals are said to be conjugate to each 
other. A set of linearly independent mutually conjugate extremals is said to 
form a conjugate system. It is clear that a set of n linearly independent accessory 
extremals 17 .*(i) having 17 .*(*i) = 0 forms a conjugate system. 

Lemma 4.1. If an admissible arc 17, and a conjugate system t)u are such that 
the matrices 

(42) II n* *11. II Va|| (f,* - 1, ... ,n) 

have the same rank at each point on x x x 7t then the functions r ,,■(*) are expressible 
in the form 

(4 3) 17 i - HkQkto. 

The functions a t (x) are of class D' except possibly at the zeros of the determinant 

D (z) - I Vik(x) |. 

At a zero x 3 of D(x) the functions a, are continuous and the functions r, tl a[ have 
unique right and left hand limits. 

To prove this, »ve note that for values of x at which the determinant D(i) 
is different from zero the solutions a,(r ) of equations (4.3) are of the form 

o, = D,/D, 

where the functions D,(x) are obtained from D by replacing by Consider 
now a value x, at whieh D(x,) has rank n - r. We may suppose that the 
functions r, it have been chosen so that the first r columns of D(x,) are com¬ 
posed of zeros. Moreover, we may assume that ,,(r.) = 0 since this can be 
brought about by adding to 17 , a suitable linear combination of the extremals 
in view of thc fact that the matrices (4.2) have the same rank. By the 
law of the mean with integral remainder we see that 

* 7 . = (* - x,)1V„ v„ = (x - x,)ir„ 0* = 1, • • • , r), 

where W t (x) t JF«(r) are continuous in x and have 

,r< (*») - »*(**) = 

It follows that in a neighborhood of x = x 3 we have 

D = (x - x 3 )'A(x), D, = (x - x 3 y* t (x) (k = 1, ... , „), 
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where A, A k are continuous in x and A(x,) ^ 0, as will be shown below. If 
we set 

a*(x 3 ) = A 4 (xj)/A(x 3 ), 

it is clear that the functions a*(x) will be continuous at x = x*. Moreover 
from the equations 

= v'itQk + Vikd' k 

it follows that the expressions »jaO* have unique right and left hand limits at 
x = x 3 , as was to be proved. 

It remains to show that the determinant 

A(*a) = I *?!/ 17 .* I 0’ ■ 1» ••• , r; h « r + 1, •. • , n) 

is different from zero. If A(x 3 ) were zero, there would exist constants a h 6 * 
not all zero such that 

at x = x 3 . Since the extremals 17 > = = 17 ^ 6 * are mutually conjugate 

and »7,(xj) = 0, the expression (4.1) would reduce at x = x 3 to the form 

0 = Ra,l',Uk = RikUiUk. 

This is possible only in case 6* = 0 by virtue of the Legendre condition. But in 
this case we would also have 17 .( 1 *) — = 0 and hence a, = 0, since the 

solutions iUi are linearly independent. It follows that A(x,) ^ 0, and Lemma 
4.1 is established. 

Consider now the conjugate system »7,*(x) and let 

Ui = VikdkW, Vi = ri ua' k (x), 

where the functions a*(x) are of class C' and a k are their derivatives. If we 
denote by primes differentiation with respect to x when a k , a' k are considered 
as constants, we have the following immediate relations 

= v ' ik <* k , v \ = 17 \ k a ' k , 

(«,;)' = Ui - t\«„j = 0, 

(d/dx)w -s = («„;)' + + w,;.. 

If now we set 

nix) = u t (x), 17 -(x) = u'(x) + Vt(x) = drji/dx, 
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it is readily seen that 7 

2u»(i7, 17 ') = 2 o(u, u' + v) 

= 2c c(u, u') + 2 v,w m . + R tk v tVk 

= + w.w.j + 2t>,w.j + R ikV ,t>* 

" + w .;) + (“< + + Ri*iVk 

= Wdx)(mu m 0 + R, k v l v k . 

It follows that if 7 , is an admissible variation, we have 
( 4 - 4 ) JW = I R*kV % v k dx £0. 

The equality holds only in ease the functions are identically zero, that is, 
on y in case o t - 0 and „ ls an extremal arc. The relation (4.4) holds even 
if thefunctions a. are not of class C'. It is sufficient that the functions «. 
have the proper.,es descnbed in Umma 4.1, as one readily verifies. Hence we 
have the following obvious consequence: 

Lemma 4.2 Suppose the strengthened condition of Legendre holds on x,x,. 
ZZJ. “ “ SU f. admu " ile "nation* such that the matrices (4.2) have the same 

an JeZy eZrelZ" ^ * °’ '» “ 

We haveTe 8 S? ^ " d * *"* * lh ' 

Corollary. Suppose the strengthened condition of Legendre holds on x,x, 
Ifthere are no conjugate points of x, on x,x„ every accessory extremal u, affords a 
mtmmum to J relative to arcs of class D' joining its end points “ 

J 0t n “ r v J ° inS ‘ h ? end P° ints o! 'he extremal «, then we have 

hL m ~ '<’• ■» - •'<- »■ “ - - - 

J(g - u) = J(n) - 2J(n, U) + J( u ) = /(,) _ j (u) g 0 > 
as was to be proved. 

LemII at‘T ° l “IT 81 Se,S ‘ We begin " ith lhe blowing 
Lemma 5.1. Suppose the strengthened condition of Legendre holds and let x, 

o, l'ari^ti'oa^ n proce^dfiigs C of a the^Internat^onaf\i r ^h S ^ 0 r,n * a * l | ,, r' 0 ^ “ 

(1924), pp. 589-603. See?JBIta SEES of^T™ PVT’ T ° r °" t0 ' vol ‘ 1 
American Journal of Mathematics, vol. 52 ( 1930 ) Va,ia " on *- 

by Bliss. 1 pp * 738-9 - further references are given 

Trans. Amer. Math. S^TolV ll 9 ^y Um ° f LagT<Xn9e With0Ut of normalcy, 
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be a conjugate point of z, of order r. For every interval x'x" sufficiently small with 
Xj in its interior there exists a set of r admissible variations £„(z) j = 1 , ... , r) 
of class C" having = 0 on the interval x"zj and 

(5.1) */({*. £,) 0*0/ <0 (A, j n 1, • • • , r) 

/or eyery se/ o/ constants (a) ^ (0). Moreover, on the interval X\x' the arc £ t/ 
defines an accessory extremal and at x = x 3 Me matrix 


(5.2) 


II *«* (./ II (i, k = 1, • • • , n;/ = 1, ... , r) 


Aas rank n, where jj,* m a sef o/ n linearly independent accessory extremals having 

V,k(x ,) = 0 . 

In order to obtain this result we suppose that the system ij,* has been chosen 
so that >?„(xj) = 00 = 1 » • • • . r ) and suppose that the interval x'x" has 
been chosen so that there are no pairs of conjugate points on x'x". Let u ti 
be a set of r accessory extremals having 


i<„(x') = i»*(x'), Uu(z") - 0 , 


and let fc, be identical with 17 ,> on X|X' f identical with u,, on x'x", and identically 
zero on x"xj. The functions have all the properties described in the lemma 
except that of being of class C". In order to prove this, let a, be a set of r con¬ 
stants not all zero and let = {</ a,. If we set 17 , = 17 */ a, on Xii* and 17 , a 0 
on XjXj, then n<(x') = $i(x'), n.(x") = {<(*") and 


•/(*) 



2«(*. {') <*x - 



2w(i7, * 7 ') dx . 


On the interval x'x" the arc {,• is an extremal arc whose end points are joined 
by the arc »?<. Hence we have /(() < /( 17 ), by the corollary to Lemma 4.2, 
since the strengthened condition of Legendre holds and the interval x'x" con¬ 
tains no pairs <rf conjugate points. Moreover by the usual integration by parts 
it is seen that 


J(v) 


/" 


M«t» 77') dx 



8 (*» v) dx 



0. 


It follows that J(() < 0, and the inequality (5.1) is established. 

In order to show that the matrix (5.2) has rank n at x = xj we note first 
that by the usual integration by parts we have 


(5.3) 


«/(*) 



W, 0 dx + 



«(*, 0 dx 


= fc(xi - 0 )«|. (x, - 0 ) - £t(zi + 0 )«|j (x, -f 0 ) 
= (i(xi + 0 ) W |j (x, - 0 ) - $<(x, - 0 ) W| j (x, + 0 ) 


= (f« “i* - *7.,va) a, a* 


(*,i - 1. ,r)> 
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where v* denote the values of along Vik and u lft respectively. It is clear 
that the matrix (5.2) has rank n at x = if and only if the determinant 

I 0‘ = 1, ••• ,r;l = r + 1, ... , n ) 

is different from zero. If this determinant were zero at x = x 3 , there would 
exist constants a„ 6, not all zero such that at x = x 3 

^ = rj„ b,. 

Since y/ M = 0(>= 1, • •■ , r) and the system is a conjugate system, it 
would follow that at x = x 3 ' 


0 — (f uVu — {„r),,bi 

m t*i u * a * = - v,,Vi>)a K . 

This is possible only in case a. = 0 in view of the relation (5.3) and J({) < 0 
for (a) * (0). But m th.s case we should have fc, = 0 also, since x, is a conju- 
gate point of order r. It follows that the matrix (5.2) has rank n at * = „ 
It is clear that the corners of f„ can be rounded off so as to obtain functions 
f, h8V,ng thc properties i" st described. This completes the proof 

5 2 , f be s “ m °f [ he ordcrs of conjugate points of x, between 
u If h , l Slreng " ,enfd condlllon of Legendre holds, there exists a set of 
s admissible variations (« = 1 ,■■■,,) of class C" which have 


(5.4) 


St )a.0, < 0 


(o, = 1, ... *) 


for every set of constants (a) y (0) and which are such that for every set of ad- 
mmible variations „(*) there is a set of constants us such that the matrices 


(5.5) 


Vik Vi + ntSit 


Vik 


“IT ^ ^ POl '“ ° n ***> ^ ’■* “ 0 conjugate system having 

For let conjugate points of x. between x, and x, be given by the sequence 


h < h < 


<L 


r; e ittir al,ty (51) ft is ciear 


Sa) = 0 
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whenever £,«, belong to different conjugate points U, t,. We may suppose 
j > h. On the interval x,x' the arc £will be an extremal arc. Moreover 
is identically zero on the interval ZyZ*. It follows that 

r*\ l*' 

So) = / U)dx = & « 0, 

as was to be proved. 

In order to prove the last part of Lemma 5.2, it should be noted that at 
x ~ lh *be functions £, r (r = 1, • • • , $*_ t ) are all zero. It follows that the 
matrices (5.5) involve only the last s - constants at x = t h . We may 
accordingly so choose the last s - s,_, constants that the matrices ( 5 . 5 ) 
have the same rank at x = t q . The constants ^ (y = + 1 , ... , $,_,) are 

then chosen so that the matrices (5.5) have the same rank at x = Pro¬ 

ceeding in this manner, we finally obtain a set of constants m such that these 
matrices have the same rank at each point on x,x t . This completes the proof 
of Lemma 5.2. 

We can now establish the following important result: 

Theorem 5.1. Let the strengthened condition of Legendre hold and let s be the 
sum of the orders of the conjugate points of z, between x, and x*. Then there exists 
a set of s natural isoperimctric conditions 

v) - 0 (0 =* 1 , ... , s) 

such that J(ij) £ 0 for every admissible arc V i satisfying these conditions, the 
equality holding only in case m is an accessory extremal. Moreover these condi¬ 
tions are such that the relation (5.4) holds for every set (a) (0). 

I*or let Sts (0 = 1 , ... , s) be a set of s functions having the properties 
described in Lemma 5.2, and consider the set of natural isoperimetric con¬ 
ditions 

< 5 - 6 ) S ( fr . t )-0 ( 0=1 ,•••,$). 

Let T)i(x) be an admissible variation satisfying these conditions and let us be 
a set of constants such that the matrices (5.5) have the same rank at each point 
on xix*. From Lemma 4.2 it follows that 

Jiv 4- Ptf£*) ^ 0 , 

the equality holding only in case the arc ij, + p£ i6 is an extremal arc. By 
the use of the inequality (5.4) and the relation 

A*? + vsh) = J(v) + 2 vsJite, v) + J(nste) 

— J(v) + «/({«, ^ 0 , 

it is found that J(tj) ^ 0 , the equality holding only in case m is an extremal 
arc. The set (5.6) is accordingly a minimal set by virtue of the corollary to 
Theorem 3.1. This proves Theorem 5.1. 


126 



CONDITION'S IN CALCULUS OF VARIATIONS 


213 


Corollary 1. If the strengthened condition of Legendre holds, the type number 
of J is equal to the suin of the conjugate points of x x between x x and x,. 

The following corollary will be useful in §9 below: 

Corollary 2. Suppose the strengthened condition of Legendre holds and there 
is no conjugate point of Xi on the interval x, £ x < x,. If ij,(x) is a broken ex¬ 
tremal whose comer is at x = x 3 and which has m(x x ) = 17 ,( 3 :,) = 0 , then J(rj) ^ 0 , 
the equality holding only in case 17 , is an accessory extremal. 

This result follows because the arc 17 , satisfies the conditions (5.6), as one 
readily verifies by the usual integration by parts. 

6 . Oscillation, separation, and comparison theorems. The results described 
in §§3 and 5 enable us to state the following new oscillation theorem. It is 
assumed throughout this section that the strengthened condition of Legendre 
holds on x,x,. Moreover, a conjugate point is counted a number of times 
equal to its order. 

Theorem 6.1. There are (at least ) m conjugate points of j, between x, and x, 
if and only if there exists a set of m admissible variations 17 . 3 such that the inequality 

(61) J(im, triads <0 (a, 0 = 1 , .. • , m) 

holds for every set of constants (a) ^ (0). If the variations v<eform a maximal 
set of admissible variations satisfying the condition (6.1), then there arc exactly m 
conjugate points of x x between Xi and x,. 

This result follows at once from Theorem 3.4 and Corollary 1 of Theorem 5.1. 
Corollary. The number of conjugate points of x x on x,x, is equal to the 
number of conjugate points of x, on x,x,. 

The following generalization of the Sturm separation theorem, first proved 
by Morse 9 and later by Hu , 10 is immediate. 

Theorem 6.2. If x 0 > Xi, then the k' h conjugate point of x x precedes the k ,[> 
conjugate of Xo. 

This result follows at once from the above corollary since if x, is the k ,h con¬ 
jugate point of xi, there must be fewer than k conjugate points of x, (and hence 
of Xo) on the interval XoX,. 

In a similar manner one can prove the stronger theorem: 

Theorem 6.3. The k'" conjugate point of x, varies continuously with x, and 
advances or regresses with it. 

Theorem 6.1 also gives us at once the usual comparison theorems. In the 
following theorem the integral J*(i 7 ) is assumed to be of the same type as ./( 17 ). 

Theorem 6.4. Let m, m* be the sums of the orders of the conjugate points of x, 
between x x and x, and let d, d • be the orders of the point x, as a conjugate point 

9 Morse, A generalization of the Sturm separation and comparison theorems in n-spare 
Mathematische Annalcn 103 (1930), pp. 52-69. ’ 

., T x he problem of Bolza and its accessory boundary value problem, Contributions to 

the Calculus of Variations 1931-32. The University of Chicago Press, pp. 361-444. 
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of X\ relative to the functionals J, J 0 respectively. If J(ri) ^ J*(v) for every set 
of admissible variations ( 17 ) ^ ( 0 ), then 

(6.2) m* £ m, m* + rf* £ m + rf • 

Moreover if J(r)) > J*(-n) for all such variations, then 

(6.3) m* £ m + d . 

To prove this, let 

•/(*., *t) = 0 (a = 1, ... , m) 

be a minimal set for J(rj) and let ( ir (y = m + 1, • • • , m + d) be a maximal 
set of linearly independent accessory extremals vanishing at x x and x t . We have 
accordingly 

lo)o* a 3 g 0 (a, 0 = 1, • • • , m + d), 

the equality holding only in case the first m a's arc zero. It follows that 

(6.4) J 0 (U Maji, gO (a, 0 - 1, •.. , m + d), 

where the equality sign can be excluded if the first m a s are not all zero. Hence 
m* ^ m. Moreover if J( 17 ) > •/*(»?) for every set of admissible variations 
(v) & (0), then the condition (6.4) holds for every set (a) * (0) with the equality 
sign excluded. Hence in this case m* £ m + d. In order to prove the in¬ 
equality m* + d 0 £ m + d, we may suppose that 

./•(U U) - 0 (« * 0). 

By the process used in the proof of Theorem 3.2 we can replace a variation 
which is not an extremal for J 0 and which has = 0 by one for which 

J 0 (£o) < 0. The number of extremals in the set cannot exceed d*. If 

these are deleted, the remaining functions will form a subset of a set of func¬ 
tions defining a minimal set for J 0 . Hence m* + d 0 ^ m + d, as was to be 
proved. 


II 

The accessory problem in the variable end point case 
In §§7-16 we shall study the functional 

J(v) = 27(171, nt) + ^ 2 w(x, 17, dx 

subject to a set of end conditions 

**07 ) = a 0i in(x ,) + b fi rji(xt) = 0 (m =1 , ••• , p g 2n), 
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where 2 q is a symmetric quadratic form in va and iy,*, and 2o> is a quadratic 
form in t u, Vi of the type described above. This problem arises in the study 
of the second variation of the general variable end point problems in parametric 
and non-parametric form. 

In these sections we define natural isoperimetric conditions for the functional 
J(ij) and show how they can be used to characterize the extremals of J(v) 
satisfying the conditions = 0 and the associated transversality conditions. 
This characterization leads to a new approach to oscillation and comparison 
theorems. 

Besides the assumptions made in §1 we shall assume that the matrix 

11 i ^ 11 

has rank p. An arc Vi of class D' satisfying the conditions = 0 will be 
called an admissible arc in the variable end point case. 


7. Orders of degeneracy and concavity. The Euler equations and the 
transversality conditions of J(v) subject to the conditions = 0 are the 
equations 

(7.1) LM = - (d/dx)w,j = 0, 

T <i(n» l ) = - w,;(xi) = 0, 

(7.2) T a (n, D = + »,;(x,) = 0 . 


The equations (7.2) will be termed accessory transversality conditions and the 
solutions vi of equations (7.1) of class C" will be called accessory extremals. 
One readily verifies that J(v) = 0 along an accessory extremal which satisfies 
the conditions (7.2) and = 0. The maximum number d of linearly inde¬ 
pendent accessory extremals satisfying the conditions (7.2) and = 0 will be 
called the order of degeneracy of J(v) relative to the conditions = 0. 

Let h be the maximum number of linearly independent accessory extremals 
vM = 1, • • • , r) satisfying the conditions ¥„ = 0 and having J(v*<it>) < 0 
for every set of constants (a) ^ (0). Let k be the maximum number of linearly 
independent accessory extremals vnix = h + 1, ... , h + A-) which have 
Vty(x\) = v*y(x j ) = 0 and which are such that no proper linear combination 
of these extremals satisfies the conditions (7.2). The number r = h + k will 
be called the order of concavity of J( v ) relative to the conditions = 0. It is 
clear that r-f-d ^ 2n — p + rf and hence that r ^ 2n — p, where p is the 
number of conditions = 0. 


For the periodic case, the number h has been called the order of concavity 
of J{v) by Morse. \\ c prefer the definition here given since it measures the 
difference between the type numbers of J(v) in the fixed and variable end point 
cases, as will be seen in §10 below-. 
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Let 

Q(S » n) = ( 

( 7 - 3 ) &(&>?) = £.<•»•, + $!«,;» 

-/({, *) = Q($, »>) + f at Qte,i)dx. 

J* i 

With these notations in mind we can prove the following result: 

Lemma 7.1. Suppose that the point j» u not conjugate to X\ and that the deter¬ 
minant | R,k | is different from zero on X\X%. If (0 = 1, • • • , r) forms a 
maximal set of accessory extremals satisfying the conditions = 0 and having 

•/({•, <0 (a, 0 m 1 , ... , r) 

for every set of constants (a) ^ ( 0 ), then every accessory extremal rjt satisfying the 
conditions 

- 0 , •/(*., 17 ) = 0 

and having J(rj) = 0 a/so satisfies the accessory transversality conditions (7.2). 

To prove this, we first note that if »?, has the properties described in the 
theorem, then the determinant 

J(v, 1') 

(7-4) J(S m , i;,) (y = 1, .. • , r + p + 1) 

(*) 

must be zero for every set of r + p + 1 accessory extremals i?,>. Otherwise 
for some set 17 * the equations 

I*) b, = 1 , •/(!«, i,)b, = 0 , **(v,)b, = 0 

would have a solution ( 6 ) ^ (0)." Hence by setting *j, = 17 ,> 6 , we should have 

- * 1 ) - •/(„) - 2 cJ(i 7 , A) + c*/(fl) = -2c + c'J(ti), 

«/({., V - Cfi) = /((«, 17 ) - <%/($., f») = 0. 

For c sufficiently small and positive we should have J(i 7 — cfj) < 0, contrary 
to our choice of the functions £,*. It follows that the determinant (7.4) is 
always zero and hence that there exists a set of constants h, h ai /„ not all zero 
such that the equation 

k/(i 7 , u) + *./(*., u) + /AM = 0 

holds for every accessory extremal Substituting the functions for u, 
in this last expression gives h a = 0 (a = 1, • • • , r). If now we replace it,- 

" See Bliss, The problem of Lagrange in the Calculus of Variations, American Journal of 
Mathematics, vol. 52 (1930), p. GS2. 
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by a set of p accessory extremals, no proper linear combination of which satis¬ 
fies the equations 'J'» = 0, it is seen that lo ^ 0, since otherwise the constants 
Jo, K would be all zero. This is not the case. We may accordingly suppose 
Jo = 1- Moreover, every pair of points u.(x,), u,(x 2 ) can be joined by an ex¬ 
tremal u it since the point x* is not conjugate to x x . It follows that the ex¬ 
pression 

J( v , u) + /„*„(“) = Tab, l)Ui(xi) + Tain, lMx t ) - 0 

must be an identity in the values li.(xi), u,(x*) and hence that satisfies the 
conditions (7.2) with the constants J„, as was to be proved. 

From this lemma we obtain at once the following method of determining the 
orders of concavity and degeneracy of J in case the point x* is not conju¬ 
gate tO X|. 

Thkorem 7.1. Suppose that the point xj is not conjugate to X \ and that 
| Rik 1^0 on X|X 2 . Let «?..(<*= 1, • • • , 2n - p) be a set of 2n - p linearly 
independent accessory extremals satisfying the conditions 'J'„ = 0. The order of 
concavity and the order of degeneracy of J are equal respectively to the negative 
type number and the nullity of the quadratic form 

H(z) * J( > 7 ., rjj)z a zg (a, 0 = l, • • • , 2n - p) 

The quadratic form H(z) is the classical one used in the study of variable 
end point problems. It was first used by A. Mayer 1 * and has been used in 
various forms by several writers. Its importance has been emphasized recently 
by Bliss.” 


8. Natural isoperimetric conditions. A condition of the type 

«/(*, v) = Qtt, v) + if) dx = 0 , 

J*\ 


where (x) is a function of class C" satisfying the conditions = 0 will be 
called a natural isoperimetric condition. It is satisfied by every accessory ex¬ 
tremal satisfying the conditions = 0 and the accessory transvcrsality conditions 

(7.2) , as one readily verifies by substitution and the usual integration by parts. 
Consider now a set of m natural isoperimetric conditions 

(8-1) */(£., v ) = 0 (a = 1, • • • , m)'. 

The Euler equations and the transvcrsality conditions of J relative to the con¬ 
ditions (8.1) and the end conditions *„ = 0 are the equations 

@- 2 ) L t (rj + = o, = 0, 

(8.3) Tn(ij + hj$ 3 , J) = o, 7\ 5 (i7 + prf,, J) = o, 


11 A. Mayer, Zur AufsUllung dcr Krilcrici, dcs Maxim urns and Minimum* dcr cinfachcn 
Integrate be, rariablcn Gnnurcrt-n, I.cipzi K er Borichtc, vol. 30 (1SS4), pp. 99-128." 

” Bliss, The problem of Bohn in th,- Cateulm of Vanafinns, Annuls of Mathematic* (2) 
vol. 33 (1932), pp. 261-274. 
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where L if T ilt Tn are the functions defined by equations (7.1) and (7.2). A 
solution rji, ho of equations (8.2) of class C" will be called an isoperimetric 
extremal. 

A set of natural isoperimetric conditions (8.1) will be called a proper set if 
the determinant 

I j((., U) I 

is different from zero. We have the following analogue of Theorem 2.1, which 
can be proved by the same methods. 

Theorem 8.1. If the set (8.1) is a proper set, then the multipliers ho belonging 
to an isoperimetric extremal m, no satisfying the conditions (8.1), (8.3), and = 0 
are all zero. 

The analogues of Theorem 2.2 and its corollaries can also be established by 
the arguments used in §2. The following theorem is the analogue of the second 
corollary. 

Theorem 8.2. If the determinant | R ik \ is different from zero, then an ad¬ 
missible arc rji which satisfies all natural isoperimetric conditions is necessarily 
an extremal arc satisfying the transversality conditions (7.2). 

The results described in §3 can be extended at once to the case here con¬ 
sidered. The proofs of the following theorems are identical with those given 
in §3. 

Theorem 8.3. If the set (8.1) forms a minimal set of natural isoperimetric 
conditions, there exists a proper minimal set composed of the same number of 
conditions. 

Theorem 8.4. If the set (8.1) is a proper minimal set of natural isoperimetric 
conditions, the inequality 

(8.4) J(U *,)a a a, <0 (or, 0 - 1, • • • , m) 

is true for every set of constants (a) ^ (0). Conversely, if the set (8.1) is a maximal 
set of natural isoperimetric conditions satisfying the condition (8.4), it is a proper 
minimal set. 

Theorem 8.5. The number of conditions in a minimal set is always the same. 
The number of conditions in a minimal set will be called the type number of 
the functional J(tj) relative to the conditions = 0. If there exists no mini¬ 
mal set, the type number of J will be said to be infinite. In view of Theorem 8.5 
the type number of J is a well defined concept. 

Theorem 8.6. If the functions me (fi = 1, • • • , m) form a set of m admissible 
variations having 

(8.5) /(*., £j)a«a* <0 (a, 0 = 1, •. • , m) 

for every set of constants (a) ^ (0), the type number of J is at least equal to m. 
If the set rug forms a maximal set of such variations, the type number of J is 
equal to m. 

This follows from Theorem 8.4 since it is clear that the variations me can 
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be replaced by admissible variations £< # of class C" without disturbing the 
inequality (8.5). 

We have the following interesting corollary which will be useful in the proof 
of Theorem 12.6 below. 

Corollary. If u(rj, rj') = 0, the type number of J is equal to the negative type 
number of the quadratic form 2q(i) h 17 ,) subject to the auxiliary conditions 'J'* = 0 . 

The condition of Legendre has been defined in §3. We have the following 
theorem concerning this condition. 

Theorem 8.7. If the type number m of J is finite, then the condition of Le¬ 
gendre holds on hit. Moreover, if the determinant \ R ik \ is different from zero 
on x x xt, the strengthened condition of Legendre holds on x,x,. 

This result follows at once from Theorem 3.5 since the type number of ./ 
relative to the conditions ij.fz,) = *.(*,) = 0 is clearly less than m. 


9. An equivalent problem. In many cases our arguments can be simpli¬ 
fied a great deal if we can assume without loss of generality that the point x, 
is not conjugate to x x . In this section we shall show that this can be done. 
We do so by showing that we can always replace our problem by a second prob¬ 
lem which has the same type number and order of degeneracy and for which 
the point z* is not conjugate to * 1 . In fact we can choose this new problem so 
as to have the same order of concavity and the same number of conjugate 
points of x x between z, and x t as our original problem. We assume that the 
strengthened condition of Legendre holds. 

Suppose now that we have given the functional 

JM - - 1 .) + f *’ 2 u (x, ,, ,') dx 

J*\ 

subject to the end conditions 

•) + KM**) = 0 (p - 1,..., p). 

Let zj be a point on z,z, which is not conjugate to either x x or z,. We may 
suppose that x 3 is the midpoint of z,z, since this can be brought about by a 
transformation of the form z = v>(z). Suppose that z, - 0 and let v < be an 
arbitrary admissible arc. We now set 


“<(*) = *i(x) = n.(r, + z/2), 

M = (Mi, • • • , u mt Vx t ••• , V*), 

2q*(u;,, w t ) = 2q[u{x\), r(z«)|, 

2«*(z, w;, w') = 2o,(z/2, u, 2u') + 2o,(z 3 + z/2, 1 /, 2v'), 

J*(w) = 2 g*(w lt wt) + j 7 2a>*(z, u;, w') dx, 

**A W ) = a 0i Ui(z ,) + b 0l Vi(x t ) = 0 (p = 1, ... tP ) t 

= v t {x 1 ) - u 1 (zj) =0 ( 1 = 1 ,..., n). 
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It is clear that there is a one-to-one correspondence between the arcs iu satis¬ 
fying the conditions — 0 (ji = 1, • • • , p) and the arcs w, satisfying the 
conditions ¥? = 0 (v = 1, • • • , p + n). Moreover, along corresponding arcs 
we have J(n) = J*(w). It follows from Theorem 8.6 that the type num¬ 
bers of J and J* are equal. In order to show that the orders of degeneracy 
of./ and J* are the same, we note that w, is an extremal for J • if and only if 
its image rji is a broken extremal for J having its corner at x = Xi. One readily 
verifies that the transvcrsality conditions for J*(u >) written in terms of the func¬ 
tions Ui, Vi defined above are expressible in the form 

7, u I“(*i)i K*t)l + “ U 4 X '> “(**)» 2u'(xi)) = 0, 

7,„(u(ii), P(*t)l + Khi + *(**), 2y'(x,)) = 0, 

w, ; (x,, t>(x,), 2e'(*i)l - w,j(xj, u(x t ), 2u'(x,)) = 0. 

From these equations it follows at once that an extremal u>, for the J* satisfies 
the transvcrsality conditions relative to J* if and only if its image m is an 
extremal for J satisfying the transversality conditions (7.2). Hence we have 
the following 

Lemma 9.1. There exists a functional J * and a set of end conditions = 0 
such that the point x t is not conjugate to x t relative to J*, and such that the type 
number and the order of degeneracy of J • relative to the conditions V* = 0 are the 
same as those of J relative to the conditions 'k, = 0. 

A stronger theorem is 

Lemma 9.2. The functional J * described in Lemma 9.1 can be chosen so that 
the order of concavity and the sum of the orders of the conjugate points of X\ between 
Xi and it of J • relative to the conditions = 0 are the same as those of J rela¬ 
tive to = 0. 

For by a transformation of the form x = v»(i) we may transform the interval 
xix 7 so that there are no conjugate points on the interval x t £ x < x 2 , where 
x t is the midpoint of xiij. If we construct a functional J*(w) as in the proof 
of Lemma 9.1, then it is clear that this functional has all the properties described 
in Lemma 9.2 except, possibly, that its order of concavity r* might be different 
from the order of concavity r of J. In order to show that r = r*, let 
(A = 1, • • • , q) be a maximal set of extremals for J satisfying the conditions 
= 0 and having y(i?*a*) < 0 for every set (a) * (0) and let u>>* be the 
corresponding extremals for J*. Clearly we have J*{w,a K ) < 0 for every set 
of constants (a) ^ (0). Let (7 = q + 1, • • • , r) be a maximal set of 
linearly independent accessory extremals having = »?,(x 2 ) = 0 and such 
that no proper linear combination of these extremals satisfies the accessory 
transversality conditions. Let w iy be-the corresponding extremals for J*. It 
is clear that J*(w y ) = 0 (y = q + 1, • • • , r). We may suppose further that 
the extremals w, a have been chosen so that 

(9.1) J\w a , w,) = 0 (or * 0; *, 0 = 1, , r). 
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Consider now the system 

(9.2) J*(w a , w) = 0 ( a = 1, ... , r - 1). 

The extremal w ir satisfies these equations, gives J * the value zero, but does 
not satisfy the transversality conditions. It follows from Lemma 7.1 that there 
exists an extremal satisfying the conditions (9.2) and V* = 0 and having 
J* < 0. Hence if we replace w ir by we obtain a set w ifi {fi = 1 , ... , r) 
satisfying the conditions (9.2) and having q + 1 of the numbers J*(u>g, wg) 
negative. In a similar manner we can replace each of the remaining extrem¬ 
als W/ y (y « q + 1, • • • , r — 1) for which J* = 0 by an extremal for which 
J* < 0. We obtain thereby a set of r extremals satisfying the conditions 
V* = 0 and having 

J*(w a , wg) = 0 (a * 0), J*(w t , wg) < 0 r). 

It follows that r ^ r*. 

In order to show that r* £ r, let u, t (y _ 1, ... , h x ) be a maximal set of 
broken extremals for J having their corners at x = x h having 

w, 7 (z.) = u iy (x t ) = 0 

and 

J(u„ u,) = 0 (y * 1 ), y(u„ * o (y, { - 1 , ... , r). 

According to Corollary 2 of Theorem 5.1 it is clear that J(u„ «,) > 0. Let 
(t = A, + 1, .. • , A) be a maximal set of linearly independent accessory 
extremalshaving «„(xi) = u,,(x,) = 0 and satisfying the accessory transver- 
sality conditions. Let (0 = I, ... , m) be a maximal set of broken ex¬ 
tremals having their corners at z = x,, satisfying the conditions *„ = 0 and 
having 

(9. 3) y( "” = 0 (*-!,•••, *). 

, 4 ) = 0 (a X 0), J( Vt , „) < 0 (a, (3=1,..., m). 

It is clear that m = r*. We may suppose that the first q of the arcs n.i form 
a maximal set of accessory extremals satisfying these conditions. Then the 
matrix 

(94) II “ 0) - Mr, + 0) || (a - 7 + 1, , r*). 

will have rank r* - 7 , where Ut denotes the values of along i, it . Other¬ 
wise there would exist a set of constants a a not all zero, such that 

(f.*(ra - 0 ) - + 0 ))a # = 0 (< 7 = 7 + 1 ,..., r *). 

It would follow that the arc 7 , = rj,.a t would be an extremal arc having J < 0 
and independent of the first 7 extremal ares (a = 1 . . - - , 7 ), contrary to 
our cho.ce of these extremals. The matrix (9.4) accordingly has rank r* - 7. 
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We shall now show that r* - q £ n - h, where h represents the number of 
extremals in the set chosen above. To do this we recall that u iy {x i) = 
= 0. Since the arcs jj* satisfy the conditions (9.3), it is found by the 
usual integration by parts that 

JiMy, *) = u iy (x t ) (Mxi - 0 ) - fu(x* + 0 )] = 0 

(y - 1, ... , h;<r - q + 1, ... , r). 

Moreover, the matrix || u, T (x a ) || has rank h since the point x% is not conju¬ 
gate to x x . It follows that the matrix (9.4) can have at most n — h columns 
if it is to have maximum rank. Hence we have r* — q £ n — h. But from 
the choice of the functions u, T it is clear that n — h = r — q and hence that 
r* r, as was to be proved. 

We can also establish the further interesting result: 

Lemma 9.3. There exists a functional J • and a set of end conditions ¥? a 0 
such that there are no conjugate points of X\ on x x it and such that the type number 
and orders of degeneracy of J* relative to the conditions = 0 are equal respec¬ 
tively to those of J relative to 4% = 0. 

The proof is like that of Lemma 9.1. We select points l, (v a 0,1, • • • , h) 
on ZiZj such that 

*• * z, < t x < • •. < tk-X < U « x, 

and such that there are no conjugate points of t,-x on the interval t,- x t, (v * 1 , 
• • • , h). We map the interval t,- X t t on Z|Z| by a transformation t - ^,(z) and 
set 

Mx) - *?,k(x)] O' - 1, • • • , h) 

(w) m (u„, • • • , u ml , • • • , u u , • • • , u,*), 

2q*(w x , w t ) = 2q[u x (x x ), u*(zj)], 

2w*(z, w, xo') - £ 2»(*. Ur, u7f»r), 

r-1 

J*(xd) = 2 q*(w x , Wt) + / 2w*(z, w, w’) dx, 

J*\ 

+ b,{u iK (x t ) =0 (m = 1, ... , p), 

*;,(u>) = u„(z,) - u,-,_,(zj) =0 (t = 1, • • • , n; v = 2, •.. , h). 

The proof that the functional J * so defined has the properties described in 
Lemma 9.3 is like that of Lemma 9.1. Thus we see that w’e can not only 
assume without loss of generality that point z* is not conjugate to x x but also 
that there are no conjugate points of x x on the interval z t zj. 
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10. The existence of minimal sets. We can now establish the following 
Theorem 10.1. Let r be the order of concavity of J and s the sum of the orders 
of the conjugate points of x x between x, and x,. If the strengthened condition of 
Legendre holds on x x x t , there exists a set of m = r + s natural isoperimetric con¬ 
ditions 

(10.1) j(b,v) = 0 (0-1, 

such that the inequality 

(10.2) J(t m , £*)a«a, <0 (a, 0 = 1, ... , m) 

is true for every set of constants (a) * (0) and such that J( v ) Z 0 for every set of 
admissible variations v* satisfying the conditions ( 10 . 1 ), the equality holding only 
in case iji is an accessory extremal satisfying the accessory transversality condi¬ 
tions (7.2). 

For in view of Lemma 9.2 we may assume that the point x , is not conjugate 
to x x . Let 

v) - 0 (a = 1, ... ,s) 

be a proper minimal set for the fixed end point case and let / O' = 1, • • • , r) 
be a maximal set of accessory extremals satisfying the conditions - 0 and 
having 

(10 3) J(vk, Vi)b h bi <0 (h t j - 1, ... , r) 

for every set of constants (6) * (0). Let Vi be an admissible arc satisfying 
the conditions 


(10.4) 


i?) - 0 (a 

v) = 0 (j 


and let u. be the accessory extremal joining the end points of 17 <. 
satisfies the conditions (10.4) since £<.(*,) - {..(*,) = 0 and 


1» ’ • • > 5 )» 
: 1, • • • , r) 
The arc u, 


^(f«, u) = u') 


0, 


*7 

J(vi, U) = uv^Tii, ,;.) = v'i) 


J («?/, v) = 0 , 


*1 


as o „ e readily verifies The arc * = „ - u. has ,,(*,) . ,,(*,) = 0 and 

satisfies the conditions (10.3). It follows from Theorem 5.1 that J(r,) g 0 the 

equality holding only in case fl, is identically zero since I, is not conjugate to 
x x . But 


J(fl) = J(, - ii) = J(i) - 2 J( v , u) + J{u) = J( v )- j{u) 
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since 

l*J *t 

•/(»h “) = w') = u,w,;(u, *') = j(u). 

Ixi X( 

Hence J(v) ^ -/(a), the equality holding in case jj, s u,. Moreover, if u, does 
not satisfy the accessory transversality conditions, we have J(u) > 0, since 
otherwise the set na could not be a maximal set of extremals satisfying the 
conditions (10.3) and 4'„ = 0. It follows that /(i?) > 0 unless »j, is an accessory 
extremal satisfying the accessory transversality conditions. The set of natural 
isoperimctric conditions (10.4) accordingly has the properties described in the 
theorem. 

Corollary 1. If the strengthened condition of Legendre holds, the type number 
of J is equal to the order of concavity of J plus the sum of the orders of the conju¬ 
gate points of T\ between X\ and ij. 14 
With the help of Theorem 8.6 one obtains the further result: 

Corollary 2. Suppose the strengthened condition of Legendre holds and 

(10.5) S(t««)-0 («-l. •••,«) 

is a proper minimal set for the fixed end point case. Let (h = 1, • • • , r) be 
a set of admissible variations satisfying the conditions (10.5) and having 

*l i)a&, <0 (h,j - 1, ... ,r) 

for every set of constants (a) * (0). The number of variations in a maximal set 
of such variations is always the same and is equal to the order of concavity of J. 
The following corollary will be useful in §13: 

Corollary 3. If the strengthened condition of Legendre holds and there are no 
points conjugate to x x on the interval hit, the type number and order of degeneracy 
of J are equal respectively to the negative type number and the nullity of the quad¬ 
ratic form // (z) described in Theorem 7.1. 

In case there are conjugate points of x x on the interval X\X-t , the type number 
and order of degeneracy of J can also be determined by means of a quadratic 
form. To prove this choose points /, (v = 0, 1, • • • , h) on X\Xi as in the proof 
of Lemma 9.3, and let v,s (0 = 1, •••,?) be a set of q = (h + 1 )n - p linearly 
independent broken extremals satisfying the conditions 4', = 0 and having their 
corners on the hyperplanes x ■ t, (v ■ 1, • • • , h — 1). Let 

(10.6) UM = J(, mt (or, 0 - 1, • • • , q). 

Interpreted in terms of the functional J * described in the proof of Lemma 9.3 
it is clear that the quadratic form ll\{z) is one of the type described in Corollary 3 
for the new functional ./*. Hence we have the further result: 

14 About a year a*o Professor Morse stated to Dr. Hestenes that in an identically nor¬ 
mal problem of Bolza the negative type number of his index form is equal to the sum of 
the orders of the conjugate points of Xi on X|X, plus the negative type number of the asso¬ 
ciated Mayer form. This is essentially the equivalent of the result described in this 
corollary. As far as we know, Professor Morse considers only the non-degenerate case. 
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Corollary 4. If the strengthened condition of Legendre holds, the type num¬ 
ber and the order of degeneracy of J are equal respectively to the negative type 
number and the nullity of the quadratic form ( 10 . 6 ). 

The quadratic form //,(z) has been used by Morse to characterize the func¬ 
tional J. He defines the negative type number of //,(z) to be the index of J. 
Thus we see that the index of J as defined by Morse is identical with the notion 
of type number of J here defined. 

The order of concavity of J can also be determined by a quadratic form even 
if the point x, is conjugate to For let x, be a value of x such that there 
are no conjugate points of x, on x, £ x < x,. Let 17 (a = 1, • • • , 3 n - p) 
be a set of 3n - p linearly independent broken extremals having their corners 
at x = x 3 and satisfying the conditions = 0 and put 

( 10 - 7 ) (« 9 0 _ 1, ... , 3ft - 7 >). 

The following theorem follows at once from the proof of Lemma 9.2. 

Theorem 10.2. If the strengthened condition of Legendre holds, the order of 
concavity and order of degeneracy of J are equal respectively to the negative type 
number and nullity of the quadratic form ( 10 . 7 ). 


11. The order of concavity of a problem with respect to a sub-problem. 
Consider now a problem P determined by a functional J and a set of end con¬ 
ditions = 0 (n = 1, ... , p) Let P* be the problem obtained by replacing 
the end conditions - 0 by - 0 (, - 1, ... . p*). If the functions are 
linearly dependent on the functions the problem P- will be called a sub- 
problem of the problem P. 

Let h be the number of accessory extremals (« - 1, ... , h) in a maximal 
set satisfying the conditions = 0 but not the conditions = 0, and having 


J(v • * va)a m a a < 0 


(«,0 » 1 , ... , A ) 


for every set of constants (a) * (0). Let k be the number of linearly inde¬ 
pendent extremals > 11 , (y — A + 1 , . ■ • , h + k) in a maximal set, every proper 
linear combination of which satisfies the end conditions = o the trans- 
versahty conditions for />• and the conditions „) = 0 (a = 1 , ... h) but 
not the transversality conditions for the problem P The number q L h ’+ k 
will be called the order of concamly of Ihe problem P with respect to the sub¬ 
problem P One readily verifies that q £ p- - p. ft is clear that the order 
of concavity r of P as defined in §7 is the order of concavity of P with respect 
to the sub-problem />* in which the end conditions are of the form 
= vifa) = 0. 

We can now establish the following result: 

Jr??! 11 ' U ‘ ?' m ‘ be ‘ he lm numbers and r * be Ihe orders of con¬ 
tra °, P ‘ \ res f eclud 'J- Lel 1 be Ihe order of concacity of P with respect to P- 
the strengthened condition of Legendre holds then 


( 11 - 1 ) 


* = m — m* ^ p* 


~ P- 


q = r — > 
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This result is immediate. For, let Un (j = 1, ••• , k x ) be a maximal set 
of linearly independent accessory extremals having U u(x\) = V {i ( i s ) = 0, 
every proper linear combination of which satis6es the transversality conditions 
for sub-problem P* but not those for P. If h, k are the numbers used in the 
definition of the order of concavity q of P with respect to P* and r, r* are the 
orders of concavity of P, P* as defined in §7, then clearly 

(11.2) kt £ k, h + h = r - r* 

by virtue of the definitions of r, r*. It follows that m* + q £ m, since by 
corollary to Theorem 10.1 we have m = r + s, m* = r* + s t where 8 denotes 
the sum of the orders of the conjugate points of x x between ii and x%. More¬ 
over, m Z m* + q. For, let * t (x) (y « 1, • • • , q) be a set of independent 
accessory extremals of the type used in the definition of q and consider the 
system 

J(t., 1?) = o («-!,••• »**•)» 

^ ^ n) = o (y - l, •••, q), 

the first m* of which form a proper minimal set for P*. We may suppose that 
the functions have been chosen so that 

J(i., Vy) = 0, 

as one readily verifies. The last k functions of the set give the functional J the 
value zero since they satisfy the end conditions and the transversality condi¬ 
tions for P*. But since they do not satisfy the transversality conditions for P, 
each of these k functions can be replaced by an admissible arc for which J < 0 
and which satisfies the remaining conditions (11.3), as can be seen by an argu¬ 
ment like that given in the proof of Theorem 3.2. It follows that m = m* + q, 
as was to be proved. 

The inequality 0 ^ m - m* £ p* - p has been given by Morse (p. 92). 
Assuming that the strengthened condition of Legendre holds we have 
Corollary. Suppose the conditions 

(11.4) /(Ul)«0 (a = 1, ... ,m*) 

form a proper minimal set for the sub-problem P* of P and let Tj iy (y = 1, • • • , q) 
be a set of q admissible variations for the problem P satisfying the conditions (11.4) 
and having 

J(rht vdQyQi <0 (y, 6 = 1, • • • , g) 

for every set of constants (a) ^ (0). The number of variations of this type in a 
maximal set is always the same and is equal to the order of concavity of P with 
respect to P*. 

12. Oscillation and comparison theorems. We assume that the strengthened 
condition of Legendre holds on x t x t . The following general oscillation theorem 
follows at once from Corollary 1 of Theorem 10.1. 


140 



CONDITIONS IN CALCULUS OF VARIATIONS 


227 


Theorem 12.1. Let m be the type number and r the order of concavity of J(i 7 ). 
There exist exactly m - r conjugate points of x x between x x and x 2 . 

We have: 

Corollary. The number s of conjugate points of x x between x x and x 2 satisfies 
the inequality 

( 12 . 1 ) m- 2 n + p^s^ m . 

This follows because r S 2n - p. The relation (12.1) is the best oscillation 
theorem of this type given heretofore (Morse, p. 95 ). 

The problem determined by the functional J and the end conditions = 0 
= 1, • • • , p) will be referred to as the problem P. Let P’ be a problem 
of the same type determined by a functional J • and a set of end conditions 
*» = 0 (y = 1, ■ ■ ■ , p*). The following result is an immediate consequence of 
the above theorem. 

Theorem 12.2. Let m, m • be the type numbers, r, r* the orders of concavity, 
and s, s* the sums of the orders of the conjugate points of x x between x x and x 2 of 
J and J* respectively. Between these numbers we have the relations 

m - m* = r - r* + s - s* , 

I m - !»* I £ I * - | + max ( 2 n - p,2n - p*]. 

The last relation follows from the fact that r £ 2n - p, r* £ 2n - p* 
Corollary 1. If «(,, ,') - «•(,, then 

m — m* = r — r * , 

I m - m* | g max [ 2 n - p,2n - p •) £ 2 n. 

Theorem 12.2 is a special case of the following more general theorem. In 
this theorem we suppose that P x , P\ are sub-problems of P, P • respectively 
and denote the type numbers of the problems P, P\ P x , P\ by m , m*, m„ mf 
and the number of end conditions for these problems by p, p\ p x , p\ respec¬ 
tively. 

Theorem 12.3 .. If q, q • are the orders of concavity of P, P • with respect to the 
sub-problems P x , P\, then 

m - m* - q - q* + m , _ m \ t 

I m - m* | £ | m, - m* | + max - p, p* _ p*]. 

This result follows at once from Theorem 11 . 1 . 

^Cobollaby 1. If the problems P, P> have a common sub-problem P, - P' u 

m — m* = q — q* f 

| m - m* | ^ max [p, - p, p, - p*J ^ 2n. 
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Corollary 2. If Pi, P J are sub-problems of the problem P a P*, then 
mi — mj = q* - q, 

| mi - m* | £ max (p, - p, p\ - p] ^ 2 n. 

Consider two problems P, P • having the same end conditions ^ = 0 
(/x = 1 , • • • , p) and let 

/'(*) = j( v ) - ✓•(,). 

The functional J'(v) and the end conditions 4% = 0 determine a problem which 
we shall denote by P' = P - P % . 

Theorem 12.4. If m' is the type number of the difference problem P‘ = P - P*, 
u»e Anve the relation 

m ^ m* -f m\ 

The theorem clearly holds if m' is infinite. Suppose now that m' is finite 
and that the set 

( 12 . 2 ) /'(!#, n )-0 0 - 1 , •••,*') 

is a proper minimal set of natural isoperimetric conditions for the functional 
J' - / - J*. It follows that 7(ij) £ •/•(»!) for every admissible arc in satis¬ 
fying these conditions. Let (7 = 1, • • • ,k) be a maximal set of admissible 
variations satisfying the conditions ( 12 . 2 ) and having 

J(vy, 171)0*0, <0 (y, a - 1, • • • , h). 

Clearly m £ h + m'. Moreover, this relation holds if J is replaced by J *, 
since the functions 17 ,* satisfy the conditions (12.2). Hence m - m' £ h £ m*, 
and the theorem is established. 

In case m' = 0 we have the stronger 

Theorem 12.5. Let m, m* be the type numbers and d, d * the orders of de¬ 
generacy of P, P*, respectively. If J(v) ^ J*(v) for every admissible arc ( 17 ) jd (0), 
then 

m* ^ m, m* -f d* ^ m + d. 

Moreover, if J ( 17 ) > 7*( 77 ) for all such variations, then 

m* m -f d. 

The proof of this theorem is like that of Theorem 6.4 in the fix$d end point 
case. 

Consider now two problems P, P * having the same end conditions and having 
«(i 7 , V) = u *(ii, 17'). Let 

171) = 9(17., vt) - Q*(m, vt)- 
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Let h, k denote respectively the negative type numbers of D( m , - D( V i, v*) 
subject to the conditions 

= + i.iljil = 0. 

We have the following theorem: 

Theorem 12.6. If the problem* P, P- are related as described above, then 

m* — k £ m £ m* + h. 

By virtue of the corollary to Theorem 8.6 it is clear that in this case the type 
numbers of the difference problems P - P> and P* _ P are equal respectively 
to A and k. Hence by Theorem 12.4 we have m S m* + A, m* £ m + k, as 
was to be proved. 

This result has been established by Morse by a different method (Theorem 
4.3, p. 93). The general comparison theorem given by Morse on page 94 
follows at once from Theorems 12.3 and 12.6. Results of this nature have 
also been established by Hu“ by still another method. 

13. Functionals varying continuously with a parameter. Consider now the 
functional 

J(v, <0 = Mm, m, <0 + / 2u(,,e) dx, 

V* 1 

where 2? is a symmetric quadratic form in whose coefficients are contin¬ 
uous in v for all values of v, and 2« is a quadratic form in „, We shall 
assume that the functions 


Pik = Pki, Q„, R„ = P„, ^L*. 

ex ' ex ’ 

belonging to 2 oi, are continuous in z and » for x on z,z, and for all values of a 
Moreover, the coefficients of and in the end conditions 

*.(v, «) = a.iWmti,) + b„(oMx,) = 0 

are assumed to be continuous in a for all „ and the matrix || a. ( t.i II is sup¬ 
posed to have rank p. We shall assume further that the inequality 

Rn(x, o)x,n k >0 (», k = 1, ... , n) 

holds on iii. for all values of * and for every set (r) * (0). It follows from 
Corollary 1 of Theorem 10.1 that the type number m(o) of J( % <r) relative to 
the end conditions = 0 is finite for all values of o. 

We have the following 

Theorem 13.1. Let m(a) and d(o) be, respectively, the type number and the 
order of degeneracy of J( v , a). For values of a sufficiently near a 0 u>e have 

11 Loc cit., pp. 425-443. 
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m(<r) ^ m(ffo). Ifd(<r 0 ) = 0, thendfo ) = 0, m(«r) = m(<r 0 ). //d(<r) = 0 (<r ^ <r 0 ), 
then 

(13.1) m(a 0 ) £ m(<r) £ m(«y 0 ) + <*(*•)• 

In order to prove the first statement in the theorem, let <r) be a set of 
m(<r 0 ) admissible variations continuous in o, satisfying the conditions ¥,,(17, <r) = 0 
for a near <r 0 , and having 

Jill W, iM, <r)a a a f <0 {a, 0 — 1, • • • , m(a 0 )) 

for a = <ro, where ./($, r \, a) is the bilinear functional (7.3) formed for the quadratic 
functional JO7, <r). This relation clearly holds for a sufficiently near <r 0 and 
hence m(<r) ^ m(<r 0 ) by Theorem 8.6. 

In order to prove the inequalities (13.1), we first note that if there are no 
conjugate points of x' on an interval x'x' relative to J(it, <ro), the same is true 
for a sufficiently near <r 0 . This result follows at once by continuity considera¬ 
tions. It follows that the construction made in the proof of Lemma 9.3 is 
valid for all a sufficiently near <r 0 and that we may accordingly assume that 
there are no conjugate points of X\ on x x x\ relative to J (17, a) for these values 
of «r. Let i7u(z, <r) (h = 1, • • • , 2n - p) be a set of 2n - p extremals for 
J(v, <0 varying continuously with a and satisfying the conditions *,(17, o) * 0. 
Let H(z, a) be the quadratic form 

H(z, <r) = /(17*, 17/, o)z#i (h, j - 1, • • • , 2n — p). 

This quadratic form is continuous in <r. Moreover according to Corollary 3 
of Theorem 10.1 the negative type number and nullity of H{z, o) are equal 
respectively to type number m(o) and order of degeneracy d(o) of J(i 7, c) t since 
there are no conjugate points of x x on x x x t for these values of <x. It follows 
from the theory of quadratic forms that if d(a 0 ) = 0, then d(<r) = 0 and 
m{a) = m(«r 0 ) for values of o sufficiently near * 0 . Moreover, the same theory 
tells us that if d(o) = 0 {a 00 ), as a increases (or decreases) starting from ao, 
the negative type number m(a) of H{z, a) increases by at most the nullity 
d(<x 0) of H(z, <r 0 ). This proves the relations (13.1), and Theorem 13.1 is es¬ 
tablished. The arguments used in this paragraph are of the type used by 
Morse in a similar situation. 

A value of a for which /(17, a) is degenerate will be called a point of degeneracy. 
Such a point will be counted a number of times equal to the order of degeneracy 
of J(i 7, <r). It is clear that points of degeneracy are identical with the charac¬ 
teristic roots of the boundary value problem 

£<(*» <r) = - (d/dx)u,‘. = 0, 

Ti 1(17, l,°) = 0, Tafa l, <r) - 0, *,(*, <0 = 

where the transversality conditions Tn = 0, Tn = 0 for the functional J are 
defined as in §7. 
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Theorem 13.2. There are at least \ m(a t ) — m(o i) | points of degeneracy 
{characteristic roots) on the interval o\ ^ a ^ a,. 

This result follows at once from the inequality (13.1) if the number of points 
of degeneracy on this interval is finite. 

Theorem 13.3. If for every integer k there exists a set of k constants <r, < 
** < ^ such that m(<r y ) ^ m{cr M ) (j = 1 , ... , k - 1), there exist 

infinitely many points of degeneracy {characteristic roots). If the function m{o) 
oscillates at each point of degeneracy, this criterion is also a necessary condition for 
the existence of infinitely many points of degeneracy. 

This result follows at once from Theorem 13.2. Thus we see that if the 
points of degeneracy are to be finite, the function m{o) can have at most a 
finite number of points of oscillation. 

Theorem 13.4. If there exists a sequence |a*| such that lim m{<r k ) = «, there 

exist infinitely many points of degeneracy {characteristic roots). This criterion 
is also necessary if m{o) is monotone and oscillates at each point of degeneracy. 

For clearly in this case m{o) must have infinitely many points of oscillation, 
since otherwise m{<r k ) could not become infinite with k. 

The following further criterion is useful: 

Theorem 13.5. If for every integer k there exist a constant <r k and a set of k 
admissible arcs i?<«(<*= 1, • • • ,k) satisfying the conditions o k ) = 0 and 
having 

(132) Vi, a k )aji, <0 (a,0 m 1, ... ,h) 

for every set of constants (a) * (0), then there exist infinitely many points of 
degeneracy (characteristic roots). This criterion is also necessary if m(<r) is mono- 
tone and oscillates at each point of degeneracy. 

For by virtue of Theorem 8.6 it is clear that m(»») £ k. Hence 

lim m{o k ) «= oo. 

The theorem now follows from Theorem 13.4. 

A more special criterion is 

Corollary 1. If there exists a point x t , a positive constant M, and a set of 
constants (in, ■ • • , ir„) such that 

lim «„(*,, < r )mm < M , 

(I3 3) lim I Q*(xs, o)r,T t \ < M, 

«0 


tom Pi k {x o , < r ) iTiT k = — co , 

there exist infinitely many points of degeneracy (characteristic roots). 

For from continuity considerations it is clear that the relations (13.3) hold 
for aU values of * on an interval *'*' containing zo in its interior. Let k be 
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an arbitrary integer and divide the interval x'x ' into k equal segments 
h a (a * 1, • •. , k). Let p a (x) be a function of class C' which is positive on the 
interval h„ and identically zero elsewhere. Set ij.. = p a (x)ir,. By virtue of 
the relations (13.3) for any x 0 on x'x' it is clear that there exists a constant <r k 
such that the condition (13.2) holds for these variations rj im . The corollary is 
thereby established. 

The following simple criterion has been given by Morse (p. 99) in a some¬ 
what different form. 

Corollary 2. Suppose that the functions /?,*, Q,* are independent of o and 
Pa = Mik — oNu. If the functions Nik are not all identically zero on X|X», 
there exist infinitely many points of degeneracy (characteristic roots). 

For clearly in this case there exists a value x 0 and a set of constants (*) 
such that 

Nik(Xo)riW k 0, 

since otherwise the functions N ik would all be identically zero on x,x 2 , the 
matrix || N ik || being symmetric since P ik = P*. for all a. For this value x 0 
and set of constants (ir) the relations (13.3) hold for either a or -a. This 
proves Corollary 2. 

The function m(<r) will be called a proper monotonically increasing function 
of o if 

(13.4) m(o) + d(o) £ m(0 (<r < o'). 

Theorem 13.6. If the function m(o) is a proper monotonically increasing 
function, the points of degeneracy (characteristic roots) are isolated. Moreover for 
values of o sufficiently near a fixed value o 0 

(13.5) m(o) = tn(o 0 ) (o < o 9 ), m(o) = m(o 0 ) + d(o 0 ) (o > o 0 ). 

There are exactly m(o,) - m(o x ) points of degeneracy on the interval o x £ e < a,. 
If there exists a value o x such that m(e x ) = 0, there are exactly m(o) points of 
degeneracy less than a. 

For by virtue of the inequality (13.4) it is clear that there can be at most a 
finite number of points of degeneracy on any interval o x £ e $ o r , otherwise 
m(o) would become infinite as a varies from o x to o 7 . This is not the case. It 
follows that in this case the points of degeneracy must be isolated. The rela¬ 
tions (13.5) follow at once from (13.1) and (13.4). The last two statements of 
the theorem are immediate consequences of the relations (13.5). Theorem 13.5 
is thereby established. 

Theorem 13.7. If m{o) is a proper monotonically increasing function, the 
criteria described in Theorems 13.3, 13.4 and 13.5 are nccessai'y and sufficient 
conditions for the existence of infinitely many points of degeneracy. 

This result is immediate. 

The existence of properly monotonically increasing functions m(o) is estab¬ 
lished by the following 
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Theorem 13.8. Suppose that the end conditions = 0 arc independent of a 
and that the inequality 

0^-6) J(v, o) ^ J(v,o') ((a < a') 

holds for every set of admissible variations (,) fi ( 0 ) satisfying the conditions 
= 0. Then the function m(<r ) is a monotonic increasing function of o. If the 
points of degeneracy are isolated, then mfo) is properly monotonic. If the condition 
(13.6) holds with the equality sign excluded, then m(„) is properly monotonic. 

This result follows at once from Theorem 12.5. Thus we see that the con¬ 
clusions described in Theorems 13.6 and 13.7 are true for a functional which 
satisfies the condition (13.6) with the equality sign excluded, at least if the end 
conditions are independent of <r. This problem has been fully treated by 
Morse (ch. IV). He makes the added assumption that m(o) = 0 for -a 
sufficiently large. This assumption is not as restrictive as it seems since if one 
is interested in the characteristic roots greater than a value a 0 , one can always 
modify the functional for values a < <r 0 so that this condition is satisfied. 

Consider now the functional J(r,) and the end conditions = 0 used in the 
preceding sections. The functional 


J(rt, a) * J( v ) - a / mmdx 

J*i 

will be called the characteristic functional of J. This functional is clearly of 
the type described in the last statement of Theorem 13.8. Moreover it is well- 
known that in this case m(o) = 0 for -o sufficiently large, provided that the 
strengthened condition of Legendre holds for /(,), as we shall assume. It 
follows that the characteristic roots of this functional are not only isolated but 
infinite in number by virtue of Corollary 2 of Theorem 13.5. We have the 
following result first proved in the general case by Morse:'* 

Theorem 13.9. The type number m of J( v ) is equal to the number of negative 
characteristic roots of its characteristic functional J( v , a) and its order of degeneracy 
ts equal to the order of a = 0 as a characteristic root. 

The Euler equation, the transversality conditions, and the end conditions 
associated with the characteristic functional J( v , „) are the equations 


(13.7) 

T •i(*7» 0 = 0, 


Li(n) - <nu = 0 , 
Tn(v, l ) = 0 , 


= 0, 


where !»*(*), T ih T it are the functions defined by equations (7.1) and (7.2). 

is system is commonly called the accessory boundary value problem The 
functions ^ associated with a solution ( f „ a, /,) of this system are called char- 
aclenshc solutions corresponding to the characteristic root a provided these 


° aS Ja ™* Sche dfr VorialionsrcchuuNg und die Ottilia- 

M.tfcrmrtfarh, Annulen. 
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functions are not all identically zero. It follows that if <r 0 is a characteristic 
root, then the number d(<r 0 ) of linearly independent characteristic solutions 
corresponding to <r 0 is equal to the order of degeneracy of 7 (jj, a 0 ). We shall 
call d(< 7 0 ) the order of the characteristic root a 0 . 

Let ./(£, 17 ) be the bilinear functional belonging to /( 17 ) and let £,(z) be a 
characteristic solution corresponding to a characteristic root. If the arc 17 ,• 
satisfies the conditions = 0 , then it can be shown that 

( 13 - 8 ) J(S, l) = <r [ tiVidx. 

J* 1 

This result follows at once by the use of equations (13.7). For, let l„ be the 
constants with which {<(*), a satisfy these equations. We then have 

v) - v) + f v) dx 
J* 1 

" *) + + I’7<"tj)i + [ 9<I«(|) dx 

™ t) + * / Vitidx 

= tr {<*7< dx, 

J‘x 


as was to be proved. Thus we see that if <r ** 0 , the orthogonality condition 
(13.9) j dx = 0 

is equivalent to the natural isoperimetric condition ./($, 17 ) = 0. The condi¬ 
tion (13.9) has been widely used in the study of boundary value problems. 

Let 


*1 £ ** ^ ••• ^ *- 

be the negative characteristic roots, each repeated a number of times equal to 
its multiplicity, and let 

$a(z), $<t(x), • • • , £,.(x) 


be a set of corresponding characteristic solutions. It is understood, of course, 
that the solutions which correspond to the same characteristic root are linearly 
independent. It is well known that /( 17 ) ^ 0 in the class of admissible arcs 
ru(x) which satisfy the conditions = 0 and 



SiaVi dx = 0 




,m). 
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We shall give here a new proof of this fact by showing that the conditions 

v) = 0 (a = 1, • • • , m) 

form a proper minimal set of natural isoperimetric conditions for J(v). In 
view of equations (13.8) and Theorem 13.9 it is sufficient to show that 

(13.10) J(i., Ma^a, <0 (a, 0 = 1, . • • , m) 

for every set of constants (a) (0). To prove this we first note that 

0 = «/(!•» U) — £•) = (*• - <rt>) I tutu dx 

J*i 

and hence that /({«, {*) = 0 whenever o* * a f , by virtue of equations (13.8) 
again. Moreover, as is easily seen, we may suppose that solutions {,« have 
been chosen so that /($., {*) = 0 even in case <r« = o d (a y* 0). The in¬ 
equality (13.10) now follows at once since a m < 0 and 

/*» 

Z*m dx (a not summed), 

i 

by virtue of equations (13.8). w 


14. The case of one variable end point. Suppose now that the functional 
J is of the form 

< 141 ) JU) =■ 2j(„) + 2»(i, ,, ,') dx, 

where 2 q is a symmetric quadratic form in the variables and that the end 
conditions are of the form 


(14.2) - a,«9<(*i) - 0, Vi ( Xt ) = 0. 

The transversality condition then takes the form 

Ta(ri , /) =3 q — « f '(Xi) =s 0. 

A value x, = x, will be called a focal point of the point x, if there exists an 
accessory extremal rn(x) satisfying the conditions 


= 0 , 


T*(v, l) = 0 , 


V%(x l) 


vith a set of constants l, and having („) jk (0) on z.x,. By the order of a focal 
point Xj will be meant the maximum number of linearly independent accessory 
extremals satisfying these conditions. 


Trllf e . HiCk6 »' A " Ration of,he Calculu, of Variation, ,o boundary value problem. 
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The following thoorcin is well-known. We assume that determinant \ Rik\ 
is different from zero. 

Lemma 14.1. Let va (*' = 1, ••• , n) be a set of n linearly independent ac¬ 
cessory extremals satisfying the conditions 

(14.4) 4/. = 0, TM, 0 = 0 

with a set of constants The system t?,* forms a conjugate system. Every ac¬ 
cessory extremal rj, satisfying the equations (6.1) at x = X\, with a set of constants 
l„, is expressible linearly with constant coefficients in the form 

I, = vu(ht K = K* a i- 

Moreover a value x, is a focal point of x t if and only if | g,,(x j) i = 0. 

We have the further well-known 

Lemma 14.2. • Every conjugate system tj satisfies the conditions (14.4) for a 
suitably chosen functional ./ and end conditions of the form (14.1) and (14.2). 

To prove this, let fa he the values of u,; along the extremals Let n — p 
he the rank of the matrix *«(j|) ||, and suppose that the functions rj., have 
been chosen so that v,„(xi) = 0 (m = 1, • • • , p). Set a p , = f,„(.ri). The con¬ 
ditions 'P M = a 0i rj t (x ,) = 0 will he chosen as our end conditions. In order to 
determine the quadratic form, we make second choice of our conjugate system 
t)u so that fa » fi, at i ■ x%. We then set 2 q = fa(xi)i},i vti. One readily 
verifies that the problem so obtained has the properties described in the theorem. 

If one uses the conjugate system rj ik described in Lemma 14.1 and makes the 
obvious changes such as replacing conjugate points by focal points, etc., one 
obtains the following theorem by precisely the same arguments as those used 
in the proof of Lemmas 5.1, 5.2 and Theorem 5.1. 

Theorem 14.1. Let t be the sum of the orders of the focal points of X\ between 
Xi and j 2 . If the strengthened condition of Legendre holds, there exists a set of t 
natural isoperimetric conditions 

AUn ) = 0 (3=1, ••• ,0 

such that the inequality J(v) si 0 is true for every admissible arc satisfying 
these conditions, the equality holding only in case m is an accessory extremal satis¬ 
fying the transversality conditions (12.3). Moreover, the admissible arcs are 
such that the inequality 

AU St)aaa, <0 (o, fi - 1, ... ,0 

is true for every set (a) ^ (0). 

We have the following immediate corollaries. We assume that the strength¬ 
ened condition of Legendre holds. 

Corollary 1. The type number m of J is equal to the sum of the orders of the 
focal points of i x between x x and Xt. The order of degeneracy d of ./ is equal to 
the order of Xt as a focal point of x x . 
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In the following corollary and elsewhere we count a focal point (or a conju¬ 
gate point) a number of times equal to its order. 

Corollary 2. The number of focal points of x, between x, and x 2 is equal to 
the number of conjugate points of x, between x , and x 2 plus the order of concavity 
of J. To obtain the number of focal points of x, on x, < x ^ x 2 , we add to this 
sum the order of degeneracy of J . 

This result follows from Corollary 1 of Theorem 10.1. 

We may call the order of concavity r of J the order of concavity on XiX 2 of 
the conjugate system n* determined by J. Since every conjugate system deter¬ 
mines a functional J by virtue of Lemma 14.2, we have the following further 

results. , . , . . 

Corollary 3. Let t, t * and r , r* denote respectively the number of focal points 
and the order of concavity of two conjugate systems 17 ,*, v ** on an open interval 
X 1 X 3 . Then 

i - - r - r* 

Further results of this nature will be given in the next section. 

We shall now give a convenient method of finding the orders of concavity 
and degeneracy of J. To do so, let 17 ,* be the conjugate system determined by 
the conditions (12.4) and let u ik be the conjugate system having n,*(x 2 ) = 0. 
Let {*<*, Vik be respectively the values of ay. along the extremals * 70 , u We 
have the following preliminary result: 

Lemma 14.3. If h is the rank of the matrix 

(14.5) — rn,vu II, 

then n — his equal to the order of degeneracy of J , that is, the order of x 2 as a focal 
point of Xi. 

For since the elements of this matrix (14.5) are all constants, we may evaluate 
them at x = x 2 . It is found then that the rank of this matrix is equal to that 
of the matrix || » 7 ,*(x 2 ) ||, since at x = x 2 we have u ik = 0 and | | ^ 0 . 

The lemma now follows from Lemma 14.1 and Corollary 1. 

The following theorem gives a method of computing the order of concavity 
of J. We assume, of course, that the strengthened condition of Legendre holds. 

Theorem 14.2. Let x 3 be a value such that there are no focal points of X\ or 
conjugate points of x 2 on the interval Xi < x ^ x 3 . If » 7 ,*(x 3 ) = «.*(* 3 ), the 
order of concavity and the order of degeneracy of J are equal respectively to the 
negative type number and the nullity of the quadratic form 

(14.6) - 1 uftik)afl k (», j, k = 1, ••• , n). 

To prove this let 

(14.7) /(«#,*) =0 tf = 1, ••• ,s) 

be a proper minimal set for the fixed end point case. We may assume that 
the functions !,-* are identically zero on the interval X 1 X 3 . Let f),> (<r = 1, • • • , r) 
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be a maximal set of admissible variations satisfying the conditions (14.7) and 
having 

(14.$) i\ 9 )b,b r <0 (a, t = 1, • • • , r) 

for every set of constants (6) ^ (0). Suppose the functions ija, have been 
chosen so that 


If now we set 


1 u(z t ) = i7i.(zi) = Uu{x j) {a = 1, • • • , r). 


u* = V<* (*i £ x g i,), = Ui, (x, £ x £ xt), 

it is clear that the inequality (14.8) is still true if we replace the functions 
*J<# by since the set (14.7) forms a minimal set for the fixed end point case. 
Moreover, by the usual integration by parts and the use of the transversality 
conditions it is found that along the arc = t2*6, we have 

J(ri) ~ 2q -I 2« dx + | 2« dx 

J*x Jm , 

- rti(x* - 0)f<(xa - 0) - i»i(xa + 0)f<(x s + 0) 

* Vi(x» + 0)J\(xa — 0) — i?<(xa — 0)f,(x # + 0) 

= (f<#u< r - m,v {r )b,b r (<r, r - 1, ... , r). 

One readily verifies that every broken extremal satisfying the end conditions 
(14.2) and having its comers at x = x a also satisfies the conditions (14.7). It 
follows that the negative type number of the quadratic form (14.6) is equal to r. 
The nullity of this form is clearly equal to n - h, where h is the rank of matrix 
(14.5), and the theorem is proved. 


15. The case of two end manifolds. Consider now the functional 

•t 


05.1) 


2$.(».) 


- 2 + [' 
J* i 


2a>(ij, rj 1 ) dx, 


where 2q x and 2q% are symmetric quadratic forms in the variables in and rj a , 
respectively. The end conditions are assumed to be of the form 


(15.2) 


**»0li) = a^iii(xi) = 0 (m = 1, • • • , Pi £ n), 

*,a0n) = b p i ij«(xj) = 0 (p = 1, ••• , pj £ n). 

In this case the transversality conditions are expressible in the form 

Tn(vi, h) = fli*,, + — «*;(*i) = 0, 

T*(vt, It) = 7a, lt + Utbpi - «,;(xj) = 0, 
as one readily verifies. 
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The problem determined by the system (15.1) and (15.2) will be called the 
problem P. The problems obtained by replacing the end conditions ¥ r * = 0 
by r]i(xt) = 0 will be called the problem Pi. By the problem P > will be meant 
the problem obtained by replacing the end conditions ¥,» = 0 by the condi¬ 
tions in(x i) = 0. The problems Pi and P 2 are of the type described in the 
last section. By the use of Theorem 11.1 we obtain the following important 
Theorem 15.1. Let m, m x , m* be respectively the type numbers of the problems 
P, Pi, Pt, and let q x , q 2 be the orders of concavity of P with respect to P x and P 2 . 
If the strengthened condition of Legendre holds, then 

(15.3) m = m, + q x = m, + q t . 

The transversality conditions and the end conditions for the problem Pi are 
given by the system 

(15.4) *piO»i)-0, T ix ( Vx ,l x ) = 0, 

and for the problem P 2 by the system 

(15.5) ¥,sOvt) = 0, Tit ( rit , lt ) =* 0. 

Let v\k> v]k be the conjugate systems of accessory extremals determined by 
(15.4), (15.5) respectively. It will be convenient to denote these systems by 
Hi and Hi. According to Lemma 14.2 every pair of conjugate systems Hi, H t 
determine a pair of problems Pi, P«, which are obviously sub-problems of a 
problem P defined by a functional of the type (15.1) subject to end conditions 
of the form (15.2). It follows that the general theory concerning pairs of con¬ 
jugate systems is obtained by the study of the problem P. 

Let f i be the value of w,- along the arc ij,. It is well-known that if an ex¬ 
tremal satisfies the conditions 

UiVi — *</{■< = 0 , 

where rm is a conjugate system, then v% is linearly dependent with constant 
coefficients on ij,,-. Hence we have the following result. 

Lemma 15.1. If h is the rank of the matrix 

llri,A - ill 

the order of degeneracy of J is equal to n — h. 

Thus we see that the order of degeneracy of J is equal to the maximum 
number of linearly independent extremals which the systems H x , Hi have in 
common. 

The zero of the determinant | ij,*(x) | will be called a focal point of the 
conjugate system r, ik and its order will be defined to be n - h, where h is the 
rank of the matrix || ruk(x») ||. A focal point will be counted a number of 
times equal to its order. 

Suppose now that we have given two conjugate systems //,, H 2) as above. 
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Let d denote the maximum number of linearly independent extremals which 
these systems have in common. Let Xj be a value of x which is not a focal 
point of either H x or Ht and choose the functions v !*, *?!* belonging to Hx, H 2 
respectively so that v\ k = at x = x a . Let q{x t ) be the negative type num¬ 
ber of the quadratic form 

(15.6) t = 1, ■ • • , n). 
The nullity of this quadratic form is clearly equal to d. The function q(x) is 
well defined except at focal point x» of // x or Ht. At such a point we define 
q(x i - 0), q(x j + 0) to be the left hand and right hand limits of q(x). It is 
clear that q(x) is a constant on any interval which contains no focal point of 
either //, or //*. The function q{x) will be called the index function of H x with 
respect to H t . The index function of //* with respect to H x is clearly the 
function n - d - q{x). It follows that <7(1) 3 n — d. 

Suppose that Hx, Ht belong to the problems P x , Pt, P described in the para¬ 
graph preceding Theorem 15.1. We have the following result. 

Lemma 15.2. The number q{x x + 0) is equal to the order of concavity of the 
problem P with respect to the sub-problem Pt. The number q{x * — 0) is equal to 
the order of concavity of P with respect to P x . 

The proof of the first statement of the theorem is precisely that of Theorem 
14.2 except for obvious changes in notation. In this case we choose the set 

(14.7) to be a proper minimal set for the problem Pt such that the functions 
are identically zero on an interval x x < x £ x a which contains no focal points 
of Xi or x 2 . If we replace the conjugate system rju, u ik by »?<*, it is found 
that q(x j) = 9(11 + 0) is equal to the order of concavity of P with respect to 
Pt. The last statement follows similarly. 

The following important result can now be proved. 

Theorem 15.2. Let H Xt //* be two conjugate systems and q(x) be the index 
function of H x with respect to /It. Let t x , It denote the number of focal points of 
H x , Ht, respectively, on an interval y. Then 

t\ - tt = q{x x + 0 ) - q(xt - 0 ) on (xi < z < it), 

h - ti - q(x 1 + 0 ) - q(xt + 0 ) on (z» < x ^ x a ), 

t\ - tt = q{x 1 - 0) - q{xt - 0) on (xi ^ x < it), 

tx - tt = q{x x - 0 ) - q(xt + 0 ) on (x x ^ x ^ z 2 ). 

In each of these cases \tx - tt \ £ n — d, where d is the maximum number of 
linearly independent extremals belonging to both H x and //?. 

For, the systems H x , Ht determine problems P, Px, Pt of the type described 
above. The first of the relations (15.6) follows at once from Lemma 15.2 and 
the relations (15.3) since m t = t lf m t = It in this case. The remaining rela¬ 
tions follow at once from the first by increasing x a , or decreasing X\, slightly. 
The last statement is a consequence of the fact that 0 ^ g(x) ^ n — d, as was 
seen above. 
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Thus we see that the equations (15.6) give us a method of finding the number 
of focal points of a conjugate system H x on an interval y when the number of 
focal points of a second system H , are known. In particular, if the number of 
conjugate points of x x on x x x t is known, the number of focal points of any con¬ 
jugate system ij ik can be computed by the use of equations (15.7). 

The quadratic form (15.6) has appeared in the literature in many different 
forms. It has been used effectively in the two dimensional case by Bliss, 1 * 
and in the n-dimensional case by Morse, 1 ’ Currier ,* 0 and Hestenes .* 1 This quad¬ 
ratic form has been used by Morse to obtain the inequality | t x — h | ^ n — d 
and other stronger inequalities.” The relations (15.7) were first given by 
Currier .* 0 

16. Periodic extremals. Suppose now that the strengthened condition of 
Legendre holds and that there exists a constant r and a set of n* constants A 
with A = | A a | ^ 0, such that the equation 

(16.1) 2w(x, 17,17') - 2 o>(z + r, ti, u') 
is an identity in x, ru, 17 J, where 

Ui — Aiilll, Ui m: Aiiv'j. 

It follows that the expression 

(16.2) Li(x, 17, v\ v") = AuLiix + r, u, u' t ti"), 
where 

Li = w,. - (d/dt)<au[ = A ,,77", 

is an identity in x, 17,, rj'i, Vi- The identities (16.1) and (16.2) are still valid if 
we replace r by or and A a by A * %i , where a is an integer and A are the ele¬ 
ments of the matrix A *. 

Lemma 16.1. If 17a is a conjugate system of accessory extremals and 

U ik (x + or) = i4*y rjik(x ), 

the functions u,*(x) define a conjugate system. Moreover , the number of focal 
points of the conjugate system u ik on the interval i, + or < 1 g i 2 + ar is equal 
to the number of focal points of rj ik on x x < x ^ x,. 

•• Bliss, A boundary value problem of the Calculus of Variations, Bulletin of the American 
Mathematical Society, vol. 32 (1926), pp. 317-331. 

** Morse, pp. 65-70, 106. See also A generalization of the Sturm separation and com¬ 
parison theorems in n-space, Mathematische Annalen, vol. 103 (1930), pp. 52-69. 

*• Currier, The variable end point problem of the Calculus of Variations including a 
generalization of the classical Jacobi condition, Transactions of the American Mathematical 
Society, vol. 34 (1932), pp. 68^-704. 

11 Hestenes, Sufficient conditions for the problem of Bolza in the Calculus of Variations, 
Transactions of the American Mathematical Society, vol. 36 (1934), pp. 793-818. 

” In this connection one should also see Hu, loo. cit. 
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The functions u ik (k = 1, ••• , n) are clearly extremals by virtue of the 
identity (16.2) with r replaced by or and A { , by A m u . If a similar replacement 
is made in equations (16.1), by differentiating with respect to v< it is found that 

“ 0 . 

If we denote the values of «,• along va, u» by fa, t>a, it is clear from this last 
expression that 

Vufa)Si k fa) = + ar)vik(z + ar), 

and hence that 

0 = mAik — Virtu = Ui,Vik — uat»</. 

The extremals t*a accordingly form a conjugate system. The last statement in 
the lemma is immediate. 

An extremal Vi will be said to be periodic if there exists an integer a such that 
(16.3) Vi(* + «r) = A^Vi Or), =* + <")• 

If a is the smallest integer for which this condition holds, then v< will be said 
to be of period ax. It is clear that if the relation (16.3) hold for one value of x 
it holds for all values of x by virtue of the identities (16.1) and (16.2). 

Let v%k be a conjugate system and Ufa ) be the number of focal points of i Uk 
on the interval x x < x £ X \ + hr . Let d „ be the maximum number of periodic 
extremals which are linearly dependent on the system if a and whose periods 
are divisors of hr . We have the following*’ 

Theorem 16.1. The limit 



exists, is independent of x x , and is the same for all conjugate systems v «• More¬ 
over, the inequalities 

(16.5) I hp - Ufa) | g n - d* (h - 1, 2, ... ) 

are true for all values of Xu If Vafa) = 0, l^ 1 

(16.6) Ufa) £ hp £ Ufa) + n - d h (h = 1,2, — ). 
For, let a = hk and 

u ik (x -f ar) = A^virtx). 

*» Cf. Morse and Pitcher, On certain invariant* of dosed extremals, Proceedings of the 
National Academy of Sciences, vol. 20 (1934), pp. 282-287. See also Hedlund, Poincares 
rotation number and Morse's type number, Transactions of the American Mathematical 
Society, vol. 34 (1932), pp. 75-97. 
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The number of focal points of uu(z) on x x + (a — A)t < x £ h -f ar is equal 
to th(xi), by Lemma 16.1. According to Theorem 15.2 the number of focal 
points of n,k on this interval differs from t k {x x ) by at most n - d k . This 
statement holds for all values of the integer k in a = hk and hence 

| ta(x ,) - kt k (x x ) | ^ k(n - d k ), 

I t a (x i) - ht k (x x ) | £ h(n - d k ). 


If now we divide by a = hk, we find that 


(16.7) 


and hence that 


0 ^ 
0 £ 


Oft) 

U(x x ) 

a 

h 

«.(*.) 

t k (x x ) 

a 

k 

1 <.(*.) 

tk(x 1 ) 

h 

k 


£ (n - d k )/h £ n/h, 
£ (n - d k )/k £ n/k, 

£ n/h + n/k. 


These relations hold for all values of x x . For h, k sufficiently large, the right 
member in the last expression can be made as small as we please. It follows 
that the limit n in (16.4) exists and is the same for all values of x x . Since the 
numbers t k {x x ) for two different conjugate systems differ by at most n, it is 
clear that the constant ** is the same for all conjugate systems rj ik . The rela¬ 
tion (16.5) follows from the first of the relations (16.7) by letting the integer k 
in a = hk become infinite. If vn(x x ) = 0, the expressions (16.7) are still valid 
if the absolute value signs are removed. The relation (16.6) is then obtained 
by letting k become infinite again. This proves Theorem 16.1. 

The constant /i will be called the frequency number: 

Corollary. If there exists a conjugate system 17a all of whose extremals are 
periodic, then the frequency number n is rational}* 

For, in this case the constant d k appearing in (16.5) is equal to n for a suitably 
chosen integer h. For this value of h we have h» = * A (xi). It follows that n 
must be rational. 


** By the use of a theorem of Birkhoff on linear differential equations with periodic 
coefficients C Dynamical Systems, pp. 77-*9) it can be shown immediately that all conjugate 
points of z, satisfy an equation A(x, x x ) = 0, where in the general case A(x,, x) = A(x, x,) 
is a linear homogeneous expression of the form 

2p(z) ******** + ■” ±Xj* 

p(x) is periodic in x of period r and Xi, • • • f A, are n quantities, real or complex. By 
means of this form complete results on the asymptotic distribution of the conjugate points 
can be obtained. Results of this sort have been announced by Morse and Pitcher (loc. cit.) 
without indication of the method employed. 
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III 

The general fixed end point problem in non-parametric form 

In §§17-21 we shall be concerned with the characterization of extremals of 
an integral of the form 

' = /> »!&»•••*?■»?»»•••» = /.> , y, y') dx 

in (xi, y,, • • • , y„)-space. Among other things, it will be shown that if there 
are m conjugate points of the initial point 1 on an extremal arc Ext, there exists 
a set of m natural isoperimetric conditions (and no fewer) such that Ext affords 
a minimum to J relative to neighboring admissible arcs joining the end points 
of Em and satisfying these m natural isoperimetric conditions, provided, of 
course, that the strengthened conditions of Weicrstrass and Legendre hold on 
Ext and the point 2 is not conjugate to 1. The strengthened condition of 
Weierstrass here used is a modification of the classical one and reduces to the 
latter in case there are no conjugate points of the point 1 on Ext. This result 
is established without the use of the classical isoperimetric theory. This is 
possible only because of the special properties of natural isoperimetric conditions. 

17. Hypotheses and preliminary results. It is assumed that the function 
/(x, y, y') is of class C 4 in an open region 9? of points (x, y, y'). A set of 
elements (x, y, y') will be called admissible if it is in 91. By an admissible arc 
will be meant an arc 

- y»W fa £ x £ xi, i - !,•••, n) 

of class D' t all of whose elements (x, y, y') are in % and joining two fixed points 
(x>, yi i) and (x*, y,j). 

An arc of class C" with elements (x, y, y') in 9? which satisfies the Euler 
equations 

(17.1) Pi = f Wi - (d/dx)/„j = 0 (t = 1, • • • , n) 

will be called an extremal arc. Such an arc will be said to be non-singular if 
the determinant 

R = M .* I 


is different from zero at each element (x, y, y') on it. It is well-known that a 
non-singular extremal arc Ext is a member of a 2n-parameter family of ex¬ 
tremal arcs 

(17.2) yi = y.foci, ••• ,ctn) 
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for special values x x ^ x ^ x t , c , = c*. The functions yi(x t c), y is (x, c) are of 
class C 3 in a neighborhood of the values (z, c) on Em and the determinant 


Vic. 


(s — 1» • • • * 2 n) 


is different from zero on E xt . 

Along an extremal arc Em the second variation of the integral J is expressible 
in the form 

JM = J 2 u(x, rj, 17 0 dx, 


where 

2" = /v,v* n.*7* + 2 / y Vi v[ + i ?i v'k . 

The variations vM which we admit are of class D' and vanish at x» and x t . 
Thus we see that the study of the second variations leads us to a problem of 
the type described in §§ 1 - 6 . 

A point 3 on Em will be said to be conjugate to the point 1 on Em if it is defined 
by a value x 3 conjugate to x x in the sense described in §1. By the order of 
the point 3 as a conjugate point of 1 will be meant the order of x 3 as a con¬ 
jugate point of x x . It is well-known that the conjugate points of h are defined 
by the zeros x 3 x x of the determinant 

yu.(x, Co) 
yu.(x 1, Co) 

belonging to the family (17.2). This result follows because the functions 
Vu = c) form a set of 2 n linearly independent accessory extremals for the 
functional J 3 (v)- An important consequence of this fact is that if the point 2 
on Em is not conjugate to the point 1 , then every pair of points which lie re¬ 
spectively in a sufficiently small neighborhood of the end points of £12 can be 
joined by a unique extremal arc. The conjugate points of the point 1 are also 
determined by the zero x 3 j* x x of the determinant 

I »«*(*, fro) I 

belonging to an n-parameter family of extremals 

Vi = yi(x, 61, ... , 6.) 

passing through the point 1 on Em, containing Em for the values h ^ x ^ x t , 
b { = bn, and having its determinant | y itbk | different from zero at the point 1 
on Em. The continuity properties of this family are, of course, the same as 
those of the family (17.2). This result follows from the remarks made in §1 
since in this case the functions Vxtfx) = y^ A (x, b) form a set of n linearly inde¬ 
pendent accessory extremals having tj,*(z,) = 0. 
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18. Natural isoperimetric conditions. Let £*(x) be a function of class C l on 
X\Xt and vanishing at x x and x*. The condition 

«/i(*) « f" Uit<+ /<*'<I *-0 
J*\ 

will be called a natural isoperimetric condition. This condition is satisfied by 
every arc which satisfies the equations 

(18.1) S K = [‘/.tdz+c, (*- 1, •••,*») 

with a set of constants Ci, and hence also by every extremal arc. This fact is 
readily established by integration by parts. Conversely, an admissible arc 
which satisfies all natural isoperimetric conditions is necessarily a solution of 
equations (18.1). This result follows from the fundamental lemma in the Cal¬ 
culus of Variations, since in this case J x (it) = 0 for every admissible arc iu of 
class C* and hence also for those of class D'. 

It is clear that the function $<(x) used in the definition of natural isoperimetric 
conditions can be replaced by functions $»(x, y) which vanish at the points 
1 and 2. However, for the present purposes, we shall restrict ourselves to 
functions of the first type. 

Consider now a set of m natural isoperimetric conditions 


(18.2) 

/>«.) - \f.dx - 0 

J* i 


■ ,m), 

where 

fa - Siafti + 



If we set 

y, y\ x) - / + x./., 




the Euler-Lagrange equations of the integral J relative to the conditions (18.2) 
take the interesting form 

(18.3) P H - (d/dx)F,. = P % + KUS.) - 0, 


where P, are the functions (17.1) and 


An arc 


Li(l) = w,. - (d/dx) w, ; . 


y< = Ifito 


(x, ^ x ^ x,), 


which satisfies these equations with a set of constant multipliers X a will be called 
an isoperimetric extremal. Along such an arc we have by the usual integration 
by parts 

(18.4) /,({„) = J St.P> dx — — J‘ 2 1 ..MWX, dx = WX/i, 
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where «/*(£, v) is the bilinear functional associated with the quadratic functional 
as described in §2. It follows that if the determinant 

(18.5) I /*(*., h) | 

is different from zero along an isoperimetric extremal E xt) this arc can satisfy 
the conditions (18.2) only in case its multipliers X« are all zero, that is, only in 
case Eh is an extremal of the functional J. 

It should be noted that if the determinant 



is different from zero along an isoperimetric extremal Eh, then E\% is a mem¬ 
ber of a (2n + m )-parameter family of isoperimetric extremals 

( 18 . 6 ) yi = y.(z, c,, • •. , c,*, A», ... , X.) 

with multipliers Xi, • • • , X„. The functions y,(z, c, X), y u {x, c, X) are of 
class C" in a neighborhood of the values (z, c, X) belonging to E u and the deter¬ 
minant 

I V*, 

\Vi.u 

is different from zero along E X y It is clear that for fixed values of (X) the family 
(18.6) defines the usual extremal family belonging to the integral 

A- I.” Fdx. 

We shall make use of this fact in our sufficiency proofs. If the isoperimetric 
extremal E X i satisfies the conditions (18.2) and is such that the determinant 
(18.5) is different from zero on it, then E X i is a member of a 2n-parameter family 
of isoperimetric extremals satisfying the conditions (18.2) and this family is 
precisely the extremal family (17.2) for the functional J, as one readily verifies. 

19. Minimal sets. A set of m natural isoperimetric conditions 

( 191 ) = / /. dx - 0 (a - 1, ... 9 m) 

will be said to form a minimal set for an admissible arc E X t satisfying these con¬ 
ditions if the arc Eh affords at least a weak minimum to the integral J relative 
to neighboring admissible arcs satisfying the conditions (19.1), and if no proper 
subset of these conditions has this property. Such a set will be called a proper 
set if the determinant 

< A *» I U) I 

is different from zero on E Xi . In the following pages we shall restrict ourselves 
to proper minimal sets unless otherwise expressly stated. In this case there 
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are infinitely many admissible arcs joining the end points of Em and satisfying 
the conditions (19.1). This result follows from the proof of the following 
lemma: 

Lemma 19.1. If the set (19.1) forms a proper set for an admissible arc Em and 
rji(x) is a set of admissible variations for Em satisfying the conditions 

(19.3) /*(*., n) - 0 (a = 1, • • • , m) 

on Em, there exists a one-parameter family of admissible arcs 

(19.4) yi = yjx, 6) (z, ^ z g xf) 

satisfying the conditions (19.1), containing Em for 6 = 0, and having ij< = y*(x, 0). 
The further derivatives y,*, i/m, y,M exist and are continuous except possibly at a 
finite number of values of x on xxt. 

To prove this, let iu(x) be an arbitrary admissible variation which satisfies 
the conditions (19.3) along Em and let 

Yi(x, at, " ’ , a*, 6) - y,(x) + a.f.(z) + bgfx ), 

the functions y t (x) being those belonging to Em- When these functions are 
substituted for in the equations (19.1), a set of m functions Ji a (a, b) having 
J la (0, 0) = 0 is obtained. The determinant of the derivatives of J\ a with 
respect to a h • • • , a m at- (a, 6) = (0, 0) is equal to the determinant (19.2) and 
is accordingly different from zero on Em- The equations Ji«(a, 6) = 0 there¬ 
fore have a unique solution a, = AJb) of class C l near 6 = 0 and having 
i4«(0) = 0. Moreover 41(0) = 0. This is also a consequence of the non¬ 
vanishing of the determinant (19.2) since in this case we have 

(d/<ft)y,.U(6) l 6)|w. = + J,(i.,v) 

= /»(«., 6.) x;(o) = 0 , 

as one readily verifies. It is clear, therefore, that the one-parameter family 

Vi = y<(x, b) = Y ,(z, -4(6), 6] 

satisfies equations (19.1), contains Em for 6 = 0, and has i?,(z) = t/*(z, 0). 
This proves Lemma 19.1. 

If now we substitute the family (19.4) in the integral J, a function J{b) of 
class C" is obtained. We must have Jiff) ^ Jiff), if the set (19.4) is to form a 
minimal set for Em- Hence we have 

(19.5) J'iff) - JxW = ['* I/„*« + f,-Vi\ dx = 0 

along Em for every set of variations satisfying the conditions (19.3). It 
follows that the determinant 
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JiM 

Jt(Za, V’) 

vanishes for every set of m + 1 variations 17 ,, (v = 1, • • • , m -f- 1).“ There 
exists therefore a set of m + 1 constants Xo, X« not all zero such that 

WiO?) + 17 ) = 0 

for every set of admissible variations 17 ,-. If now we substitute for 77 * the varia¬ 
tions it is found that 

X^,({„ fc) = 0, 

and hence that the multipliers X, are all zero, since the set (19.1) forms a proper 
set. We may accordingly choose Xo = 1. The expression (19.5) must there¬ 
fore hold for all admissible variations whatever. By the use of the funda¬ 
mental lemma in the Calculus of Variations we obtain the following result: 

Theorem 19.1. If the set (19.1) forme a proper minimal set for an admissible 
arc Eh, there exists a set of constants c< not all zero such that the equations 

(19.6) = \ f vi dx + Ci 
hold at each point on E lt . 

If the set (19.1) is a minimal set but not a proper minimal set, the function / 
in (19.6) must be replaced by a function of the form F = X 0 / + X«/ a . This 
result follows from the theory of isoperimetric problems. 

An admissible arc Bit will be said to satisfy the condition of Weierstrass if at 
each element (x, y, y') on it the inequality 

E(x, y, y\ J") 0 

for every admissible element (x, y, Y') (x, y, y'), where 

(19.7) E = f(x, y, Y’) - f(x, y, y') - (Y\ - y\)f^x, y, y'). 

We have the following result: 

Theorem 19.2. Suppose that the conditions (19.1) form a proper minimal set 
for an admissible arc E XJ . If E \ s affords a strong minimum relatively to neighbor¬ 
ing admissible arcs satisfying these conditions, then £„ satisfies the condition of 
Weierstrass. 

In order to obtain this result, let the point 3 be an arbitrary point on E lt , 
not a corner point or an end point of E lt . Let K', be an arbitrary direction and 
Y »(x) be an arbitrary curve of class C passing through the point 3 and having 
Y*( x *) — y .3- The functions defining E xi will be denoted by y,(x). We now 

“ Cf - Blis, s, The problem of Lagrange in the Calculus of Variations, American Journal of 
Mathematics, vol. 52 (1930), pp. 682 , 692. 
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construct a one-parameter family of arcs y,(z, b) joining the end points of E » by 
setting 

Vi (x, b) = y*(z) (z, ^ x ^ it) , 

Vt(x,b) = Yi(x ) (zi ^ x £ b), 

S,(x, b ) « »,(i) + (K,(t) - y,(6)l (b£x£x,). 

) — It 

This family is of the type usually used in the proof of the necessity of the 
Weierstrass condition for a minimum in the classical sense. It has the follow¬ 
ing important properties 

y*(z, z,) s 0 (z, £ z £ x,), y*(x t , b) = 0, 

(19.8) 

y*(x # + 0 , z a + 0 ) = Yi(xt) - y<(zj). 

However, this family does not necessarily satisfy the conditions (19.1). In 
order to construct the one-parameter family of admissible arcs which satisfies 
these conditions, we substitute the functions 

(19.9) y<(z, 6) + a.|<.(z), y<,(z, 6) + a. a (z) 

for y<, y ■ in Ji({.) and obtain a set of functiohs Ji.(a, 6) having J i«(0, Zi) = 0. 
The functional determinant of these functions with respect to the variables 
a,, • • • , a m is precisely the determinant (19.2) and is accordingly different from 
zero. It follows that the equations J»«(a, 6) = 0 have a unique solution 
a„ = 4«(6) of class C ' for 6 £ x> and sufficiently near 6 = x». Moreover 
■/la(xa) “ 0. The one-parameter family 

(19.10) y< - yi(x, b) + 

therefore satisfies equations (19.1) for 6 ^ z a and sufficiently near 6 = z 2 , as 
was desired. 

By substituting the functions (19.9) for y, and y • in the integral J one ob¬ 
tains a function J(a, 6). Recalling that the family (19.10) satisfies the condi¬ 
tions (19.1), one sees that 

JlA(b), b] Z J(z,) (6 £*,), 

since the 6et (19.1) forms a minimal set for Eu. Hence we must have 

(19.11) (d/db)J[A{b), &]|b_o = J.A 0, z>)ii'(0) + A(0, *.) & 0. 

By differentiating the function J(a, b) with respect to a a it is found that 

(19.12) /-.(0,x,) = /,«.) = 0. 

Moreover, if we write 

J(0,b) = £/<** + 
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by differentiating this expression with respect to b and setting b = x 3 we obtain 
from relations (19.8) and the usual integration by parts the relation 

MO, x,) = /( x„ y„ Y',) - f{x, y, y') - f‘‘ + / ,„' k | dx 

= E(x t , y h y' lt Y’ t ). 

Combining this result with the relations (19.12) and (19.11) we find that the 
theorem is true for all points on E xt except possibly at the corners and end points 
of Eu. That the theorem is also true at these points follows at once from con¬ 
tinuity considerations. 

An admissible arc E X1 will be said to satisfy the condition of Legendre if at 
each element (x, y, y') on it the inequality 

holds for every set (*■) ^ (0). If this inequality holds with the equality sign 
excluded, then the strengthened condition of Legendre will be said to hold on E xi . 

Theorem 19.3. If the conditions (19.1) form a proper minimal set for an 
admissible arc E xi , then the condition of Legendre holds on E xt . If E xi is non¬ 
singular, then the strengthened condition of Legendre holds on E i7 . 

To prove this, we note that according to the proof of the last theorem the 
function 

*>(6) = E(x, y, y’, y' + br), 

where the elements (x, y, y') are on E„ and (t) is arbitrary, must have a mini¬ 
mum at 6 = 0. It follows that *>'(0) - 0, t"(0) = g 0, as was to be 

proved. This simple proof has been used by Morse. 

Theorems 19.3 and 19.4 give a new and broader significance to the conditions 
of Weierstrass and Legendre in the sense that they are not only necessary con¬ 
ditions for a minimum in the classical sense but also in the more general sense 
here defined. 

We have the further result 

Theorem 19.4. If the set (19.1) forms a proper minimal set for an extremal 
arc Ei 2 , the inequality J 2 (v) £ 0 is true for every admissible variation ru(x) satis¬ 
fying the m conditions •/,(£., v ) = 0 along E xi . 

This result is obtained by differentiating the function J(b ) appearing in the 
proof of Theorem 19.1 twice for b and noting that /"(0) = J t ( v ) must be posi¬ 
tive or zero if the set (19.1) is to form a minimal set for E x2 . 

20. Mayer fields and an analogue of Hahn’s Lemma. Consider now an 
n-parameter family of extremals 

( 20 - 1 ) yi = y.Cr, <*!,••*, a n ) 

whose functions yi (x, a), y^x, a) are of class C" in a region of points (x, a) with 
x on x x £ x £ Xi and a, in A. Suppose further that the determinant | y iok | is 
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different from zero within this region and that this region has been chosen so 
small that the equations (20.1) have unique solutions a< = a,(x, y ) of class C" 
for all Or, y) in a region g in xy-space. The family (20.1) is then said to simply 
cover the region g. Let 

Pi(z,y) = ViAx,a(x,y)\. 

The region g will be called a Mayer field if the Hilbert integral 

J* = J f(x, y,p) dx + (dyi - p, <&)/,;(*, y, p) 

with pi = p,(x, y) is independent of the path in g. The functions p*(x, y) are 
called the slope functions of the field and the solutions (20.1) of the equations 
y 'i = p%(x, y) are called the extremals of the field g. Along an extremal of the 
field we have clearly J • = J. 

The following theorem is well-known. In this theorem the function 
E(x, y, p, y') is the Weierstrass ^-function defined by the equation (19.7). 

Theorem 20.1. If Ext is an extremal of a field g at each point of which the 
condition 

Elx, y, p(x, y), y'] > 0 

holds for every admissible set ( x, y, y') j* (x, y, p), then the inequality 
J(Cit) > J(Eit) is true for every admissible arc Cm in g joining the end points 
of Em but not identical with Ext. 

This result follows in the usual manner by the use of the Weierstrass integral 
formula 

J(Cxs) - AExt) = [ E(x, y, p, y') dx, 

>»i 

which is readily established by the use of the equations 

J(Ext) = J*(Ext) - J*(C„). 

We shall need the following lemma: 

Lemma 20.1. If the Hilbert integral J* formed for the family (20.1) is inde¬ 
pendent of the path on the hyperplane x = Xo, the family (20.1) defines a Mayer 
field over every region g which it simply covers. 

This result is well-known and is due to Hilbert. 24 
A lemma of a different type is the following one: 

Lemma 20.2. If a non-singular extremal Em has on it no point 3 conjugate to 1, 
there exists a conjugate system of accessory extremals 17 ,* for Em having its deter¬ 
minant different from zero on Em- 

This result and its proof are also well known. One needs only to take the 


*• Gottinger Nachrichten, (1905), p. 165; Math. Annalen, 62 (1906), p. 356; see also Bliss, 
loc. cit., p. 733. 
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conjugate system having ij<* = 0 at a value xq to the left of x, and sufficiently 
near X \. 

Consider now a proper set of m natural isoperimetric conditions 

(20.2) Ml.) = Jy. dx = o (a = 1, ■ • ■ , m), 

and let F = / -f X„/ a . The Weierstrass ^-function formed for the function F 
will be denoted by Ex(x, y, y\ Y'). It is precisely the ^-function for the integral 

Jx = f F{x, y, y', X) dx. 

An extremal arc Em for our original integral J will be said to satisfy the 
strengthened condition of IFezers/ross relative to the conditions (20.2) if at each 
element (x, y, y') in a neighborhood 0? of those on Em the inequality 

Ex(x, y, y\ TO > 0 

holds for every admissible set (r, y, Y') * (x, y, y') and for every set of con¬ 
stants (X.) in a neighborhood A of (X) = (0). If there are no conditions of 
the type ( 20 . 2 ), this condition becomes the usual strengthened condition of 
Weierstrass. 

We have the following analogue of Hahn's Lemma." The proof here given 
is like that of Bliss. 2 * 

Lemma 20.3. Let Em be an extremal arc having on it no point conjugate to its 
initial end point 1. If the strengthened conditions of Weierstrass and Legendre 
hold on Em, there exists a neighborhood g of Em in xy-space and a neighborhood 
N of the end points of Em in (x i y l x 2 y i )-space such that every isoperimetric extremal 
E x in 5 with end points in N and having Us multipliers (X) sufficiently near 
(X) =* (0) affords a proper strong minimum to J x relative to admissible arcs in $ 
joining the end points of Ex. 

To prove this let r, ik be a conjugate system of accessory extremals for Em 
having its determinant Vik (x) different from zero on x,x,. Such a conjugate 
system surely exists by virtue of Lemma 20.2, since in this case there is no 
point on Em conjugate to the point 1. Let f a be the values of along Vik 

on Em. We may suppose that the functions i u k have been chosen so that 
Vue = Sa at x = Z|. 

Consider now the (2 n + m)-parameter family of isoperimetric extremals 
V* = fl l> • • • ) °n, bi , ... , b n , Xi, • • • , Xj 

described in §18. The constants a„ 6,- may be chosen to be the values of y„ 


^ Hahn Ober Variationsprobleme mil variablen Endpunklen, Monatshefte fur Mathe- 
matik und Physik, vol. 22 (1911), pp. 127-136. 

vor^^pTlM B ° ha ' n ^ CaUuluS ° f Vari< - ions > Annals of Mathematics, 
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and Zi = + \ofay' at x = X\. Let a,v, 6 l0 , X a = 0 be the values of (a, b, X) 

belonging to Em and let 

2 w(a) = f,*(xi)(a i - a*) (a* - a w ). 

The family 

(20.3) y. = y»(x, a, 6 , X) = K,(x, a, 6 + u>*, X) 
contains for a, = a, 0 , &i = 6 l0 , X. = 0, and has 

(20.4) yw,*(xi, a, b, X) = 6 ik = i»,t(x,), Zi-*(xi, a, b, X) = fu(xj), 

where z, = Along Em the functions yu k form a set of n accessory extremals 
for which z iak represents the values of «,». Hence we must have y** = r) ik along 
£*11 since they have the same initial conditions at x = x,, by virtue of the 
equations (20.4). The determinant | y imk | is accordingly different from zero 
along Em and hence also for all values of (x, y, b, X) near those on Em. The 
equations (20.3) therefore have a unique solution a { = < 2 j(x, y, b, X) of class C" 
for values (x, y, b, X) in a neighborhood S of those on E xi . Let g be the pro¬ 
jection of this neighborhood in xy-spacc. For fixed values of (b, X) belonging 
to a set (x, y, 6, X) in S the n-parameter family (20.3) defines a Mayer field 
with slope functions 

Pi(x, y, b, X) = y«(x, a(x, y, b, X), b, X) 

over the region 5 in xy-space. This follows from Lemma 20.1, the Hilbert 
integral J * being independent of the path on the hyperplane x = Xi since on 
this hyperplane we have 

J* = / d(w + biQi), 

as one readily verifies. Moreover, if the region 5 is diminished still further, 
then by virtue of the strengthened condition of Weierstrass for Em the inequality 

Ex[x, y, p(x, y, 6 , X), y'\ > 0 

will hold for every set (x, y, p, X) in S and for every admissible set (x, y, y') y* 
(x, y, p). It follows from Theorem 20.1 that every extremal of the family 

(20.3) with elements (x, y, 6 , X) in 5 and a< = a,(x, y, 6 , X) will afford a proper 
minimum to the integral J relative to admissible arcs in g joining its end points. 

The proof of Lemma 20.3 will be complete if we show that every isoperimetric 
extremal with end points (xi, y.i, x t , y.t) in a neighborhood N of those of Em 
and with multipliers (X) in a neighborhood A of (X) = (0) is an extremal of the 
family (20.3) with elements (x, y, 6 , X) in S and a, = a<(x, y, 6 , X). This result 
follows at once. For, since the end points of Em are not conjugate, every pair 
of points (xi, y0, (x*, y s ) in N can be joined by an extremal E of the family 

(20.3) . If N and A are taken sufficiently small, the elements (x, y, b, X) be¬ 
longing to E will lie in S. The constants a, must accordingly be given by the 
functions a,(x, y, 6 , X), since these solutions are unique. Lemma 20.3 is now 
proved. 
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21. The existence of mi nim al sets. We are now in position to establish the 
following basic result: 

Theorem 21.1. Let s be the sum of the orders of the conjugate points of the 
point 1 on an extremal arc E xi . If E xi satisfies the strengthened conditions of 
Weierstrass and Legendre, and the point 2 is not conjugate to the point 1, then 
there exists a set of s natural isoperimetric conditions 

*A(£a) =0 (a = 1, • • • , s) 

such that E\j affords a proper strong minimum to the integral J relative to neigh¬ 
boring admissible arcs joining its end points and satisfying the conditions (20.1). 
Moreover, no proper subset of these conditions has this property . 

As a first step in the proof of this theorem let 

to = *,<<, < ... < tq </, +l = 

be a set of points on the x-axis such that there are no pairs of conjugate points 
on the sub-arc of E xi defined by the successive points determined by the values 
t, and t r + 1. Let {,« (x) be a set of s functions of class C* such that the equations 

( 2U ) MU, v) = 0 ( a = 1, ... ,s) 

form a proper minimal set of natural isoperimetric conditions for the second 
variation Jib) along E lt . The number of these conditions is equal to the sum 
of the orders of the conjugate points of the point 1 on E xi , by Theorem 5.1. 
Moreover, according to the corollary of Theorem 3.1 we have 


MU, <0 (a, 0 = 1, ... , s ) 

for every set of constants (a) * (0). Let ,u be a set of broken accessorv ex¬ 
tremals having 


(2U) = «*•(« (r - 0,1, ... , g + 1). 

Since each extremal segment of 17,. is a minimizing arc for J t (ri) we have 

(213) lt,)a.at> < 0 . 

The equality holds only in case the as are all zero, since otherwise the functions 
fi.a., eea. would be identical. This, however, is impossible since the first 
of these arcs has no corners. An important consequence of this fact is that 
the determinant 


(2U) I ’ll) - •/>(£„, (,) | 

is different from zero along E Xi . 

As a second step in the proof of Theorem 21.1 let 


(21.5) 


Jt(U = l Sl fadX = /** + Aj*;.) dx = 0 
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be the natural isoperimetric conditions for E i* determined by the functions £<„ 
just described- The Euler equations of the integral J subject to these condi¬ 
tions are the equations (18.3). Let 

(21.6) y< = y%{x, bn, • • • , 6 ,*, A w • • • i A.) (x { £ x £ x t ) 

be a (qn + s)-parameter family of broken isoperimetric extremals having its 
corners on the hyperplanes 

x = t\, x = it, ••• , x - t q 
and containing Bit for the special values 

b„ = b M , X. = 0 (i-1, ...,n;r-l, = 

The parameter 6 ,> can be chosen to be the values of y, on the hyperplane x = t,. 
We have accordingly the equations 

y<(*., b, A) - y»i, y<(L,A) = 6 ,„ y<(x,, b,\) = y it . 

If now we set = y< Xa (x, 6 o, 0), then by differentiating these equations with 
respect to A« it is seen that 

(21.7) u,.( i.) = 0, Ui.(L) = 0, Ui.(it) = 0. 

Upon substituting the family (21.6) in the Euler equations (18.3) and differ¬ 
entiating with respect to A. again we obtain by setting ( 6 , A) - ( 6 0 , 0) the 
equations 

+ $•) - 0. 

It follows that the variations u,. + form a set of s broken accessory ex¬ 
tremals vu = + U. By virtue of the equations (21.7) these extremals 

satisfy the conditions ( 21 . 2 ) and are accordingly identical with the extremals 
Via described in the last paragraph. The determinant 

(21.8) I Mi) | (<*» 0 m !»***» s ) 

can now be shown to be different from zero on the arc E lt . For, if we substi¬ 
tute for un its values va — it is found that 

«/*(£«, Mfl) = •/»(£<>, Vi) — Zi)' 

Moreover, J&., vs) = Uv., v$), as can be seen by the usual integration by 
parts with the help of equations (21.2). The determinant (21.8) is therefore 
equal to the determinant (21.4) and hence must be different from zero, as was 
to be proved. 

As a third step in the proof of Theorem 21.1 we substitute the family ( 21 . 6 ) 
in the functions and obtain a set of s functions X) having 

y,.( 6 0 , 0) = 0. The determinant of the derivatives of these functions with 
respect to the variables X,, , X. at ( 6 , X) = ( 60 , 0) is equal to the deter¬ 

minant (21.8) and is therefore different from zero. The equations J,.(b, X) = 0 
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accordingly have solutions A« = A«(6n, • • • , b gn ) 6f class C" such that X a (6 0 ) = 0. 
When the functions \ a (b) are substituted in the equations (21.6) a new qn- 
parameter family • 

(21.9) y, = yi(x, bn, • • • , b qn ), = A«(6n, • • • , b qn ) (x\ ^ x £ x 2 ) 

of broken extremals is obtained which satisfies the conditions (21.5) and con¬ 
tains Bn for 6„ = 6,x). For this new family the parameters b, r still represent 
the values of y { on the hyperplane x = t,. It follows that the variations 

6y> = !/*»(*, b 0 ) db if (j, r both summed) 

are not all identically zero if the constants db ir are not all zero. Since the 
family (21.9) satisfies the conditions (21.5), it is clear that the variations 
ij% = 6y, must satisfy the conditions (21.1). Hence we have Jt(Sy) > 0 along 
E\ 2 , by Theorem 5.1, since the point 2 is not conjugate to 1. An important 
consequence of this fact is that the functional J takes a proper minimum on 
Ext relative to the family (21.9). To prove this we need only substitute the 
family (21.9) in the integral J and note that for the function J(b) thus obtained 
we have dJ = 0, d*J = Jt(6y) > 0, ( 6y ) ^ (0), at 6,> = b ir o and hence J(b) > J(b 0 ) 
for (6) ^ (bo). 

We are now in position to complete the proof of Theorem 21.1. To do so 
let 3' be a neighborhood of E lt so small that each extremal sub-arc of the family 
(21.9) with ends on successive hyperplanes x = t n j = t r+l affords a strong 
minimum to the integral 

•A = jf ** 1/ + X./.I dz 

relative to admissible arcs in joining its end points. This is possible by 
virtue of Lemma 20.3. Let 3 be a neighborhood of E\\ interior to 5' such that 
every admissible arc C joining the end points cuts the hyperplanes x = i r in 
points (i r , b ir ) whose coordinates b ir determine an extremal E b of the family 
(21.9) lying in 8'. By Lemma 20.3 we have J X (C) £ Jx(B>), the equality hold¬ 
ing only in case C s E b . Suppose now that the arc C satisfies the conditions 
(21.5). We then have J(C) ^ J(E b ), since the arc E h also satisfies these con¬ 
ditions. If necessary, we may diminish the region g still further so that 
J(E b ) ^ J(E xi ), as described in the last paragraph. We then have J(C) > J(E,t) 
unless C s Ext. The proof of Theorem 21.1 is now complete. 

By the use of Taylor's expansion with integral remainder it is found that the 
strengthened condition of Legendre implies the strengthened condition of Weier- 
strass, if we restrict the admissible seta (x, y, y') to lie in a sufficiently small 
neighborhood of those on Ext. Hence we have the following result: 

Corollary 1. If the phrase l, strong minimum" is replaced by “ weak mini¬ 
mum" the conclusions of Theorem 21.1 are still true if the extremal E l2 satisfies 
the strengthened condition of Legendre but not necessarily that of Weierstrass , at 
least if the final end point 2 is not conjugate to the initial point 1 on Ext . 
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We have further 

Corollary 2. If an extremal Bit satisfies the strengthened condition of 
Legendre and the point 2 is not conjugate to \ ,the number of conditions in a minimal 
set of natural isoperimetric conditions for Eh is always the same and is equal to 
the sum of the orders of the conjugate points of the point 1 on E u. 

With the help of the corollary to Theorem 3.1 we obtain the further result: 
Corollary 3. If the strengthened condition of I^egendre holds on an extremal 
arc En whose end points are not conjugate, and if the set (21.5) forms a proper 
minimal set for Eh, the inequality 

<0 (a, 0 - 1 ,•••,«) 

holds on En for every set of constants (a) * (0). 

22. The variable end point case. Consider now the functional 

J *= 9\y(x »), y(*01 + | */(*, v, y ') dx 

J* i 

subject to a set of conditions of the form 

f Jv(*i)» v(**)l = 0 (m - 1* • • • » V £ 2n). 

The values x Xt x t are, of course, assumed to be constants. The functions 
g(y u y t ), ^„(y,, y,) are assumed to be of class C" in a neighborhood of the end 
values of the extremal En in question and the matrix 

II tun II 

is assumed to have rank p. An arc C of class D' with elements (x, y, y') in the 
region R described in §17 will be called an admissible arc for the variable end 
point case if it satisfies the conditions = 0. 

If we set G = g + the transversality conditions for the problems here 
considered are given by the equations 

(22.1) f K (xi) = G mi /»;(**) - “Gw*- 

Along an extremal En satisfying the end conditions = 0 and the transversality 
conditions (22.1) with a set of constants the second variation is expressible in 
the form 

(22.2) Jt(q) = 2?U(*»)> »»(**)] + f* rj, v') dx, 

where 2w is defined as before, and 
2 q = Gv, lV;1 T.i^n + 

To prove this let 

y» = yfa b) (*i ^ x ^ x ,) 
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be a one-parameter of admissible arcs containing E\% for 6 = 0 and having 
suitable continuity properties. Substituting this expression in the functional 
J we obtain a function J(b). This function is expressible in the form 

J(b) = G[y{x Xt 6), y(x 7 , 6), l) + f[x, y(x, b), y\x, 6)) dx, 

where l„ are the constants belonging to E lt . Differentiating twice with respect 
to b and setting 6 = 0, i?» = y*(x t 0 ) one obtains the expression (22.2) by 
the usual integration by parts and by the use of equations (22.1), as one readily 
verifies. 

The equations of variation of the end conditions = 0 on E l7 are the equa¬ 
tions 

(22.3) %(y) = *» a *(*i) + Vi (x 7 ) - 0 . 

It follows that the functional (22.2) subject to the conditions (22.3) determines 
a problem of the type described in §§7-10. If functional J 7 {r,) is non-degen¬ 
erate on E x7 , then the extremal E X t will be said to be non-degeneiate. By the 
order of concavity of Ext will be meant the order of concavity of the functional 
Jt(v) relative to the end conditions. 

Let £ i[x t y, y x , yt] be a set of functions of class C 4 satisfying the conditions 

'b*n$a + ^* m£.i = 0 , 

where 

£<i *= £l*i» y(x i), y{x i), yi(xt)), £,* = £,(xa, y(x t ), y{x i), y(z 2 )). 

A condition of the form 

Mi) - 9, a i<i + 9 w Ja + j {/*£< + f'i\ 1 dx * 0 

J a i 

where £'< =* £,, + £, v * y' k will be called a natural isoperimctric condition. It is 
satisfied by every extremal arc satisfying the conditions - 0 and the trans- 
versality conditions (22.1). 

Let E xi be an extremal arc satisfying the end conditions ^ = 0 and the trans- 
versality conditions (22.1). A set of m natural isoperimetric conditions 

(22.4) JxU.) = 0 (a = 1, • • • , m) 

will be called a minimal set for E X t if the arc E x t affords at least a weak relative 
minimum to the functional J relative to neighboring arcs satisfying the condi¬ 
tions (22.4), and if no proper sub-set of these conditions has this property. 
Such a set will be said to form a proper set for Ext if the determinant 

I Jt (£«, £5) | 

is different from zero on E xl . By an argument similar to that used in §19 it can 
be shown that if the set (22.4) is a proper set for E lit there are infinitely many 
admissible arcs satisfying the conditions (22.4) in every neighborhood of E x7 . 
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Theorem 22.1. Suppose the conditions (22.4) form a proper minimal set for 
the extremal arc Ext. If Ext affords a strong minimum of J relative to neighboring 
admissible arcs satisfying these conditions (22.4), the condition of Weierstrass 
holds on Ext. 

The proof is similar to that of Theorem 19.2. Ut !?<,(*) be a set of (m + p) 
variations such that the determinant 


(22.5) 


Jt ($., riy) 

(y = 1 , • • • , m + p) 


is different from zero on Ext, and set 

(22.6) «, b ) = y*( x » b ) + a t’»«rW 

where y,( x, b) is the family used in equations (19.9). When this family is sub¬ 
stituted in the expressions /»(*.), a set of m + p functions J Xm (a, b), *„(a, b ) 
is obtained having J,.(0, x>) = 0, *„(0, x,) = 0. Moreover, the functional 
determinant of JxM 6), *„(a, b) with respect to a„ • • • , <w p at (a, 6) = (0, x,) 
is the determinant (22.5) and is accordingly different from zero. The 
equations 

7i(a, b) = 0, *,(< 1 , 6) - 0 

therefore have solutions a, = a,(6) of class C' with o 7 (i,) = 0. The famUy 

(22.7) Vi - Vt(*. W + o»(k)w»(*) (*i ^ * S it) 

satisfies the conditions (22.4) and = 0 and contains £„ for 6 = The 
remainder of the proof is now like that of Theorem 19.2 if we replace the families 
(19.9), (19.10) by (22.6), (22.7) respectively. 

Theorem 22.2. If the set (22.4) forms a proper minimal set for E,t, the con¬ 
dition of Legendre holds on E». If E„ is non-singular, the strengthened condi- 
tion of Legendre holds. 

The analogue of Theorem 19.4 is the following 

Theorem 22.3. If the set (22.4) forms a proper minimal set for Ext, the in¬ 
equality JtW £ 0 is true for every set of admissible variations satisfying the con¬ 
dition % = 0, Jt(U v) = 0 along Ext . 

If our end conditions are of the special form 


(22.8) lyfo)l * 0, i,t [y(xj)l = 0, 

the functions used in the definition of natural isopcrimetric conditions can 
be taken in the form [x, y(z)]. The following theorem can then be estab¬ 
lished by arguments similar to those used in the proof of Theorem 22.1. We 
assume that Ext is not tangent to the end manifolds and that the end points 
of Ext are the only pairs of points on Ext satisfying these end conditions. 

Theorem 22.4. Let Ext he a non-degenerate extremal satisfying the end condi¬ 
tions (22.8) and the transversality conditions (22.1). Let r be the order of con- 
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cavity of Em and s the sum of the orders of the conjugate points of 1 on Em between 
l and 2. If Em satisfies the strengthened conditions of Weierstrass and Legendre, 
there exists a proper set of m = r + s natural isoperimetric conditions (22.4), 
such that Eh affords a proper strong minimum for the integral J relative to neigh¬ 
boring admissible arcs satisfying these conditions. Moreover , no proper sub-set 
of these conditions has this property. 

It is interesting to note that Theorem 22.5 holds even if the point 2 is conju¬ 
gate to 1. Our only restriction is that the arc E\t be non-degenerate. 

Corollary 1. The conclusions of Theorem 22.5 are still true if u>e replace 
the phrase “strong minimum" by “weak minimum ” and omit the condition of 
Weierstrass. 

Corollary 2. If the strengthened condition of Legendre holds on a non¬ 
degenerate extremal Em which satisfies the transversality conditions (22.1), the 
number of conditions in a minimal set is always the same and is equal to the order 
of concavity plus the sum of the orders of the conjugate points of the point 1 on Em 
between 1 and 2. 

The analogue of Corollary 3 of Theorem 21.1 is also true, as one readily 
verifies. 

IV 

The general problem in parametric form 

In §§23-27 we shall be concerned with a functional of the form 

J = g[x{tx) t *(/,)) + J* ' f(x, x) dt 

defined on an (n + l)-dimensional Riemannian manifold $R with local coordinates 
of the form (i°, **,-••, x n ). Our end conditions will be taken in the form 

lM*(<»), *(h)l = 0 (p = 1, ... , p S 2 n + 2). 

In our sufficiency theorems we shall restrict ourselves to the separated end point 
case, that is, to the case in which the end conditions are of the form 

^ [*(<«)] = 0, *„[*«,)) = 0. 

In this part we shall show that if a non-degenerate extremal arc Em satisfies 
these end conditions, the associated transversality condition and the usual 
strengthened conditions of Weierstrass and Legendre, there exists a set of m 
natural isoperimetric conditions such that Em affords a strong proper minimum 
relative to neighboring admissible arcs satisfying these isoperimetric conditions 
and the end conditions. Moreover, there is a smallest set of natural isoperi¬ 
metric conditions and the number of conditions in such a set is equal to the 
order of concavity of Em plus the sum of the orders of the conjugate points of 
the point 1 on Em between 1 and 2. It should be noted that this result is true 
even if the end points of E l2 are conjugate to each other, provided that Em is 
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non-degenerate. In the fixed end point case Bn is clearly degenerate if its end 
points are conjugate. Of particular interest is Theorem 26.1, below, which 
gives an extended sufficiency theorem for the fixed end point case. The results 
of this part are obtained without the use of the classical isoperimetric theory. 

In this case we do not need to modify the strengthened condition of Weierstrass 
as was done in the non-parametric case. 

23. Hypotheses and preliminary definitions. The function /(z, x ) is assumed 
to be of class C 4 for local coordinates ( x ) on the Riemannian manifold 91 and 
for all vectors (z) (0). The function /(z, z) is defined so as to be invariant 

under transformations of the form 

y* = z°, z 1 , • • • , z") , y { m ^ z* (t, k = 0, 1, • • • , n), 

where the functions y*(z) are of class C*. We assume also that / is positively 
homogeneous of order one in the variables (z). 

Let W be the product manifold 9? X 91. Its local coordinates 

(*?, ••• ... t x\) 

are composed of the coordinates (z,) and (z,) of its image in 91 and are to be 
transformed accordingly. The functions g(x i, z») and ^(zi, z») are assumed 
to be of class C* on 91*. 

A regular arc is a sensed arc defined locally by equations of the form 

x i « x*(t) (tx £ t £ t>) 

of class C' and having (z) * (0). A neighborhood of a regular arc E xi which 
does not intersect itself can be imbedded in a single admissible coordinate sys¬ 
tem (z) with z° > 0 along E x2 . It will be understood throughout this part that 
such a coordinate system has been chosen. Our results are, of course, inde¬ 
pendent of the coordinate system used. 

By an admissible arc will be meant a continuous sensed arc which is composed 
of a finite number of regular sub-arcs and which satisfies the conditions = 0. 
At times it will be convenient to call an arc admissible even if the particular 
set of end conditions f* = 0 here given are not satisfied. In such a case an 
explicit description of the end conditions used will be given. Unless otherwise 
expressly stated, it will be assumed that the matrix 

\\+*i MW 

has rank p on the arc E x % in question. 

The Euler equations associated with the functional J are the equations 

(23.1) Pi * U - Wdt)Ja = 0 (t = 0, 1, • • • , n). 

A solution of these equations of class C" is called an extremal arc . Such an arc 
will be said to be non-singular in case the matrix 

(23.2) liM* II ft * = 0,1, • • • , n) 
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has rank n. For a non-singular extremal arc the parameter t can be chosen so 
that the functions x*(0 defining E lt are of class C*. 

The transversality conditions associated with the functional J are the equa¬ 
tions 

(23.3) fi*(x h x,) = G,J , fn(xt, ± t ) = - G r j, 

where G = g + l^ m . 

24. The accessory problem. Along an extremal arc E x 2 satisfying the end 
conditions //„ = 0 and the transversality conditions (23.3) the second variation 
is always expressible in the form 

J 1 ( 17 ) * 2tf(i»i, ijt) + J' 2«(ij, i)) dt , 

where 

2^ - G m \ m [ vtvi + 2 G.{.{ + G.j.j 

2 w = i|V + 2 ijty + /i-ii . 

This result follows at once if we consider our problem as a non-parametric 
problem in (ta)-space with t x and 1 3 fixed. The variations rj { (() are assumed to 
be of class D' and to satisfy the conditions 

%(v) - n%) + *> * 0 Gi - 1, • • • , p). 

Such variations will be termed admissible variations. 

The problem determined by the functional J t ( i?) subject to the conditions 
= 0 will be called the accessory problem. It is of the type described in §§7-10. 
We define, as before, a natural isoperimetric condition to be a condition of 
the form 

« 0 ( 1 , n) + / C> Q(| t v)A - 0 , 

J* i 

where 

Qtt, *?) = cifcf + = Q(i, I), 

9) = i + £«* = fl(n, $), 

and the functions ?(t) define admissible variations of class C". We define 
minimal sets, proper sets, type number, etc., as in § 8 . However, Theorem 10.1 
is not applicable in this case since the determinant (23.2) is now identically zero. 
The accessory equations are the equations 

(24.1) LM) = - (d/dl)«* = 0. 

A solution of class C" of these equations will be called an accessory extremal. 
It is well known that if p(t) is of class D\ then 17 *' = pi { is a solution of equa- 
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tions (24.1). Moreover, the functions Li(i?) are not independent since they 
satisfy the conditions = 0. These results are immediate consequences 

of the identities 

±fii = /, XfUik H 0, if HA = frk , 

which hold because of the homogeneity relations f(x, ki ) = kf(x, i) (k > 0). 

In order to establish the analogue of Theorem 10.1, it is convenient to intro¬ 
duce first the notion of special accessory extremals. To do so, let <p(x, x) be a 
function of class C 1 which is different from zero along Em, is positively homo¬ 
geneous of order one in the variables (i), and is such that the equations 

•p t i — {d/dl) •pn =* 0 

hold along Eu. Such a function can be constructed in many ways. For ex¬ 
ample, if f{x, i) is different from zero along Em, we may choose <#>=/» or if W(x) 
is a function of class C 4 such that dW > 0 along Em we can choose <p = W ,< x', 
as one readily verifies. It should be noted that the determinant 


(24.2) 


fnik •pn 

v»i* 0 


is different from zero on Em if and only if the matrix ||/i*r* || has rank n on Em > 
This fact can be established readily with the help of the following well-known 
consequences 


*/**■<>, = 


of the positive homogeneity of the functions / and p. 

We now define a special accessory extremal* to be an accessory extremal which 
satisfies the equations 


(24.3) (d/dt)(pirf) =* + Vitf = a = constant. 

It is easily seen by direct substitution that a special accessory extremal rj‘ is 
of the form if = pi' if and only if p is of the form p = (of + b)/<p, where a, b 
are constants. 

A value f 3 y* h will be said to define a point 3 conjugate to the point 1 on Em if 
there exists a special accessory extremal having if(U) = v'{t a) = 0 and (jj) ^ (0) 
on Id j. By the order of 3 as a conjugate point of 1 will be meant the num¬ 
ber of linearly independent special accessory extremals in a maximal set having 
V(b) = if V*) = 0 and M (°) on be convenient to say that fa is 

conjugate to h if fa defines a point 3 conjugate to 1 on E »*. 

Every special accessory extremal satisfies with X = 0 the equations 

(24.4) Liin) + Vi. = 0, (d*/c/f*)(v»i.V) = 0. 

j» Bliss, Jacobi's condition for problems of the Calculus of Variations in parametric form, 
Trans. Amer. Math. Soc., vol. 17 (1916), pp. 195-206. Hestenes, A note on the Jacobi 
condition for parametric problems in the Calculus of Variations, Bull. Amer. Math. Soc., 
vol. 40 (1934), pp. 297-302. 
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Conversely, every solution i f, X of class C" of these equations has X = 0 since 

■ 0, xv* = v> ^ 0, 

and hence the functions »?* define a special solution. The equations (24.4) 
therefore completely characterize special accessory extremals. 

When the determinant (24.2) is different from zero, the equations (24.4) can 
be solved for the variables i?', X. The solutions have the form X = 0 (since 
z'LiM = 0) and 


It follows that special accessory extremals have properties like those of ordinary 
accessory extremals described in §1. For example, there are 2n -f 2 linearly 
independent special accessory extremals ij'*(Z) (s = 1, • • • , 2n + 2) such that 
every special accessory extremal rj' is expressible linearly with constant coefficients 
in terms of these 2n -f 2 extremals. Moreover, the conjugate points of h arc 
determined by the zeros t» y* t, of the determinant 


v"(0 

V«i) 


(s = 1, • • • , 2n + 2). 


It follows that if Z* is not conjugate to Z>, the points (t h rji) and (Zj, rjt) can be 
joined by a unique special accessory extremal. Similarly, there are n + 1 linearly 
independent accessory extremals n ,k (t) (k = 0, 1, • • • , n) having *j'*(Zi) = 0 and 
such that every accessory extremal tj' having rj'(Z,) = 0 is expressible linearly in 
terms of these extremals and such that the conjugate points of h are deter¬ 
mined by the zeros Zj i, of the determinant | r/ ,k (l) |. 

In this case the condition of Legendre will be said to hold on En if at each ele¬ 
ment (x, x) on En the inequality 


is true for every set of constants (*) * (pi). If the equality sign can be ex¬ 
cluded then the strengthened condition of Legendre is said to hold on En. 

It was seen in §3 that if the type number of J j(i?) is finite, the condition of 
Legendre must hold. Conversely, if we set 

2 w(»j, n) dt, 

Jt\ 

l*(t, v) = f fltt, v) dt, 

J * i 

We have the following result: 

Theorem 24.1. LeZ s be the sum of the orders of the conjugate points of Z l 
between t x and Z 5 . If the strengthened condition of Legendre holds, there exists a 
set of s (and no fewer) natural isoperimetric conditions 


hit; V) = 0 


(a = 1, • • • , S) 
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such that I 2 (v) ^ 0 for every admissible arc ij*(() vanishing at h and l t and satis¬ 
fying these s conditions. The equality holds only in case rj • is expressible in the 
form ij l = m* — px' where u ‘ is a special accessory extremal having u'(t t ) = u‘(/j) 
= 0 and p(t ) is a function of class D' vanishing at h and <*. 

The proof of this result is like that of Theorem 5.1. Lemma 4.1 is true here 
also if the conjugate system tju there used is replaced by a set of linearly inde¬ 
pendent special accessory extremals having rj a (t,) = 0, as one readily veri¬ 
fies. The proof of the fact will be simplified if one notes that one of the ex¬ 
tremals v a (0> say the first, can be chosen to be the special accessory extremal 
n’° = (t — and that the remaining n solutions can be chosen so as to 

satisfy the condition = 0. Moreover, in this case the equation (4.4) takes 
the form 

/.(i) = f* vVdt Z 0 , 

where v { = The equality holds only in case v { = <r(f)x*. From the rela¬ 

tion 

•Pav* * o(t)y> 

we conclude that (f - <i)aj = o(t)<p and hence that the derivatives a[, • • • ,a n 
arc zero. It follows that in this case the functions at, • • • , a„ are all constants 
and a 0 = p{t) vanishes at h and t* and hence that V is expressible in the form 
described in the last statement of the theorem. The remainder of the proof can 
now be made by the arguments like those used in the proof of Theorem 5.1. 

We now return to the functional Jfa) and define the order of concavity of 
Ji(rj) formally as in §7 using special accessory extremals instead of the accessory 
extremals there used. With these definitions in mind we can prove the fol¬ 
lowing: 

Theorem 24.2. Let r be the order of concavity of J 7 (ri) and s the sum of the 
orders of the conjugate points of h between t\ and It. If the strengthened condition 
of Legendre holds, there exists a set of m = r s natural isoperimetric conditions 

(24.5) v) = 0 (a — 1, ••• , m) 

such that the inequality J 2 (ij) ^ 0 holds for every set of admissible variations y' 
satisfying the m conditions, the equality holding only in case r, { is expressible in 
the form rj { = - px‘, where u* is a special accessory extremal satisfying the condi¬ 

tions it = 0 and p(Q is a function of class D' having p(h) = p(< 2 ) = 0. More¬ 
over, the inequality 

<0 (a, 0 = 1, ... , m) 

is true for every set of constants (a) j* (0). 

The proof is like that of Theorem 10.1 with the help of Theorem 24.1. 
Corollary 1. The set (24.5) forms a minimal set for the functional Jib). 
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25. Natural isoperimetric conditions. In this section we assume that E\% is 
an extremal arc (not intersecting itself) satisfying the end conditions = 0 
- and the transversality conditions (23.3). 

Let P(x) be a contravariant vector of class C 4 such that 30 

*„d) = = 0 

on the manifold = 0. The condition 

/id) = M'(*0 + 94/PM + /I/*! 4 +/**<) dt = 0, 

J* i 

where {*' = will be called a natural isoperimetric condition. It is satisfied 
by every extremal arc which satisfies the conditions +, = 0 and the transversal¬ 
ity conditions (23.3), as one readily verifies. 

Consider now a set of m natural isoperimetric conditions 

(25.1) /»(!.) - 0 (a=l, ...,m). 

Such a set will be called a minimal set for the extremal arc E xt if E\% affords at 
least a weak minimum in the class of admissible arcs satisfying these m condi¬ 
tions (25.1), and if no proper subset of these conditions has this property. The 
set (25.1) will be called a proper set for E xi if the determinant 

I /.da, t«) I 

is different from zero on E x %. In this case there exist infinitely many admissible 
arcs satisfying the conditions (24.1) in every neighborhood of Eh, as follows 
from 

Lemma 25.1. If the set (25.1) forms a proper minimal set for E xit for every 
admissible variation n*(t) satisfying the conditions 

/id., v) = 0 (a = 1, • • • , m) 

there exists a one-parameter family of admissible arcs 


x< = x\t, b) 


satisfying the conditions (25.1), containing E\% for b = 0 and having rj* as its 
variations along Eit. The derivatives x\, x\ h , xi ht x\ bb exist and are continuous 
except possibly at a finite number of points on t x t 2 . 

The proof is like that of Lemma 19.1. Let * 4 (t) (7 = 1 , ... t m + p) be a 
set of m + p admissible variations whose determinant 


(25.2) 


/«(«-. *) 
*,(<!») 


ao In order to discuss the general case completely one must choose the functions f of 
the form f(z, x h zj. However, the functions f(x) are sufficiently general for the sep¬ 
arated end point case. We shall accordingly restrict ourselves to this simple case. 
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is different from zero on £|» We substitute the functions 

r{f) + <m;«) + MO 

in equations (25.1) and = 0 and obtain a set of functions /i«(a, b), ^„(a, 6) 
having Ji a (0, 0) = 0. The functional determinant of b), ^„(a, 6) with 

respect to the variables a,, • • • , a m + p at (a, 6) = (0,0) is precisely the determinant 

(25.2) and is accordingly different from zero. The equations Jxa{g, b) = 0, 
^„(a, b) = 0 have solution a, = A y (b) of class C", with A y ( 0) = 0. Moreover, 
if we differentiate the expression JiJtA ( b ), b), we find, as in the proof of Lemma 
19.1, that A ' (0) = 0. It follows that the family 

= j*(0 + A y ( bW y ( i ) + by *( t ) 

has the properties descril>ed in the lemma, as one readily verifies. 

The arc E X t will be said to satisfy the condition of Weierstrass if at each ele¬ 
ment (x, x) on Ei7 the inequality 

E(z, x, y) £ 0 

holds for every non-null set (y) * (kx) (k > 0), where 

(25.3) E(z, x, y) * /(z, y) - y'/*(x, x). 

We have further 

Theorem 25.1. If the set (25.1) forms a proper minimal set for the extremal 
Ext and Ext affords a strong minimum relative to neighboring admissible arcs safts- 
fying these conditions, then Ext satisfies the condition of Weierstrass. 

This result follows at once from Theorem 22.1. For, if we consider our prob¬ 
lem as a non-parametric problem in (fx)-space, the image of Ext must be a 
minimizing arc for this problem and hence we have 

fix, y) - f(z, z) - (y< - iOM*, i) > 0 . 

But this function is precisely the ^-function (25.3) since / = if a. 

The condition of Legendre has been defined in the last section. We now have 
Theorem 25.2. If the set (25.1) forms a proper minimal set for the extremal 
arc Ext, then the condition of Legendre holds on Ext■ If Ext is non-singular, the 
strengthened condition of Legendre holds on Ext. 

This result follows from Theorem 25.1 in the same manner as Theorem 19.3 
follows from Theorem 19.2. 

Theorem 25.3. If the set (25.1) forms a proper minimal set for Ext, the in¬ 
equality Jtiv) ^ 0 holds for every set of admissible variations v l satisfying the 
conditions 

/«({., g) = 0 (a = 1, • • • , m). 

The proof is like that of Theorem 19.4 with the help of Lemma 25.1. 
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26. A fundamental theorem. In this section we shall establish an extended 
sufficiency theorem of apparently new type for the fixed end point case. This 
theorem will be useful in our final sufficiency theorems to be given in the next 
section. 

We begin with the notion of Mayer field. A Mayer field is a region g 0,1 our 
Riemannian manifold 9? together with a set of contravariant vectors p'(x) 
with (p) (0) and of class C' on g such that the Hilbert integral 

I' = if A*, Pto) <k‘ 

is independent of the path in g. The solutions of class C" of the equations 
x* = p'(x) are called extremals of the field. Along such an extremal E we have 
I*(E) = 1(E), where 



The following basic lemma is well-known and follows at once from the inde¬ 
pendence of the path of the Hilbert integral /*. The function E used in this 
lemma is, of course, the Weierstrass ^-function (25.3). 

Lemma 26.1. If E is an extremal of a field g at each point of which the in - 
equality 

(26.1) E(z,p(x),x\ > 0 

holds for every admissible set (x, x) * (x, kp) (k > 0), the inequality 1(C) > 1(E) 
holds for every admissible arc C in g joining the end points of E but not identical 
with E. 

An extremal E xt will be said to satisfy the strengthened condition of Weierstrass 
if at each clement (x, x) on B tt , the inequality 

(26.2) E(x, x, y) > 0 

holds for every non-null set (y) ^ (ki) (k > 0), where E is the Weierstrass E- 
function. The strengthened condition of Legendre has been described in §24. 

Lemma 26.2. If the extremal Ext satisfies the strengthened condition of Weier¬ 
strass and Legendre, the inequality (26.2) holds for every set (x, x) in a neighbor¬ 
hood N of those on E xi and for all values of (y) (ki) (k > 0). 

The proof of this lemma is well-known. 

We have the following analogue of Theorem 20.1: 

Lemma 26.3. If an extremal arc Ext satisfies the strengthened conditions of 
Weierstrass and Legendre and has on it no point conjugate to its initial end point 1, 
there exists a neighborhood g of Ext and neighborhoods Nx, iV* of the end points 
1 and 2, respectively, such that every extremal E in g with ends in N lt N* affords 
a proper minimum of the functional I in the class of admissible arcs in g joining 
its end points. Moreover, every sub-arc of E affords a proper minimum relative 
to admissible arcs in g joining its end points. 

For suppose the coordinate system (x) has been chosen so that x°(0 > 0 
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along Em. We may accordingly choose t = x° as the parameter for E it and 
and also for the neighboring extremals. These extremals will accordingly be 
the extremals for the non-parametric integral 

I = [ F(z, y, y') dx , 

J*\ 

where (x°, z") = (z, • • • , y,) and 

Pfo V, Vl - /(*°, •••,*", 1, x\ ... , z"). 

The Legendre condition for the non-parametric integral is clearly the same as 
that for the parametric integral. Similarly, the conjugate points of 1 are the 
same for the two problems. Hence it follows from the proof of Lemma 20.1 
that there exist neighborhoods 3 of E, t and N tl Nt of the ends of E xi such that 
every extremal in 3 with ends in Nx, Nt is an extremal of a Mayer field defined 
over 3- Moreover, if 3 is taken sufficiently small the condition (26.1) will 
hold in each of these fields, by virtue of Lemma 26.2. The lemma now follows 
from Lemma 26.1. 

We come now to our fundamental theorem. We assume, of course, that 
E\t does not intersect itself. 

Theorem 26.1. Let Em be an extremal arc satisfying the strengthened conditions 
of Weierstrass and Legendre and let Eu be a sub-arc of E\% having on it no point 
conjugate to its initial point 3. There exists a neighborhood 3 of Eu such that E u 
affords a proper minimum of the integral I relative to admissible arcs in 3 joining 
its end points. 

In order to obtain this result, we choose a constant d so small that no sub-arc 
of Eu whose length does not exceed d has on it a pair of conjugate points. 
This can be done by virtue of the fact that the conjugate points of a given 
point are isolated. 

Let P 0 , Pi, • • • , P ff+ i be a set of successive points on E\% such that the length 
of the sub-arc P,P*+ j does not exceed d. We may suppose the point P 0 lies a 
little to the left of the point 1 and P q + X lies a little to the right of the point 2 
and that for some value of k the points P*, P**j coincide with the points 3 and 4 
of the sub-arc Eu in question. Through the points P/ pass analytic manifolds 
jr j cutting Ext orthogonally. From Lemma 26.3 it follows readily that there 
exists a neighborhood 3' of Ext such that not only Eu but also every extremal 
E in 3 # with ends on the manifolds P,_i, P/+ 1 affords a proper minimum of I 
relative to admissible arcs in 3 / joining its end points but not crossing the 
manifolds passing through its end points. Let 3 be a neighborhood of Ext 
interior to 3' such that every pair of points Q/, Q/+i in 3 on the manifolds 
*/, 7r/ + i, respectively, determine an extremal segment E in 3' w *th ends on the 
hyperplanes ir/_i and x/+*. Let E, be the segment of E between Q/ and Q/+ 1 . 
It is clear that Ej affords a proper minimum to the integral I relative to ad¬ 
missible arcs in 3 / joining its end points but not crossing the manifolds 
x#->» ***• 
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With the help of these preliminary constructions we can prove the theorem 
as follows. Let C be an admissible arc in ft joining the end points of the sub- 
arc Eu. It is clear that the curve C crosses the manifolds x,- at most a finite 
number of times. If C does not cross the manifolds x* and x*+j, which pass 
through the end points of E u , then 7(C) > I(E M ) t unless C m Eu, by virtue of 
the remarks made in the last paragraph. Suppose now that C crosses the 
hyperplane x*. Let x A _i be the first manifold on the left which C does not 
cross. As the point P moves along the arc C from the point 3 to 4 it will 
cross the manifold x A at a first point Q k and will subsequently reach the mani¬ 
fold ita+i at a first point Qk+ ». Let C' be the segment of C bounded by the 
Qh and Qk+i and let E be the extremal joining its end points. The arc C is 
not identical with E, since otherwise C could not cross x A at the point Q A . 
Hence we have 7(C') > 1(E) according to the remarks made in the last para¬ 
graph. If now we replace the sub-arc C' of C by E, we obtain a new curve 
Ci with 7(Ci) < 1(C). The segment E of C\ may not lie wholly in ft, but thi9 
fact leads to no difficulties since the arc C\ crosses the manifolds x/ at points 
in 5. It is clear that the arc C\ crosses x A at most r — 2 times, where r de¬ 
notes the number of times the arc C crosses x A . By a repetition of this process 
to the arc C x , and so on, we finally obtain a curve C A which does not cross x A . 
Proceeding in this manner we obtain next a curve C K + 1 which does not cross the 
manifold x A+ i and finally a curve Ct which does not cross the manifold x* 
passing through the point 3 on E M . Clearly 7(C,) < 7(C). In similar man¬ 
ner we may replace the curve Ct by a curve Ct in ft' which joins the end points 
of Eu but does not cross either of the manifolds x A or x A+ j passing through 
the end points of E u . We have accordingly 

7(0 > 7(0) £ I(Eit) 
and the theorem is established. 

Corollary. Every extremal sub-arc in ft' joining points in 5 lying on suc¬ 
cessive manifolds ir it r Hl affords a proper minimum of J relative to admissible 
arcs in ft joining its end points. 

The results described in Theorem 26.1 can be extended readily to the case 
in which Eu has pairs of conjugate points on it provided that its end points 
are not conjugate. This can be done by adjoining a suitably chosen set of 
natural isoperimetric conditions for the arc Eu. The key point in the proof 
lies in the fact that we can choose these natural isoperimetric conditions to be 
identically zero on the left of the manifold x A passing through the point 3 and 
on the right of the manifold x A+ i passing through the point 4. 

27. The existence of m ini m al sets. Consider how a set of natural iso¬ 
perimetric conditions 

(27.1) Ji($ a ) = g a ( Xl , xt) + / *fm(x, i)dt = 0 (a = 1, ... , m). 

1 
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Let F = / -f- \ a f a and Ex{x, x, y ) be the Weierstrass E-function formed for the 
function F. 

We shall need the following lemma which can be established by the usual 
arguments: 

Lemma 27.1. If an extremal arc Eh satisfies the strengthened condition of 
Weierstrass and Legendre, there exists a neighborhood N of the values (x, x, X) 
on En such that the inequality 

£x(z, i, y) > 0 

is true for all values ( x, x, X) in N and every non-null set ( y ) ^ ( hi ) (k > 0). 

Let 

A = [ " (f + dt 
J'i 

and suppose that the arc Em satisfies the strengthened condition of Weierstrass 
and Legendre. With the help of the last lemma it is seen that the analogues of 
Lemmas 20.1 and 26.3 are true also for the functional 7 X with (X) in a neighbor¬ 
hood A of (X) = (0). Suppose now that Em be cut orthogonally by regular 
analytic manifolds ir 0 , r x , • • • , t«+i as described in the proof of Theorem 26.1. 
By an argument like that used in the proof of Theorem 26.1 and its corollary 
one obtains the following 

Lemma 27.2. Under the above hypotheses there exist neighborhoods 3' of Em 
and A of (X) = (0) such that every isoperimetric extremal arc Ex in 3' with multi¬ 
pliers X« in A and joining a pair of points on successive manifolds and tt**., 
affords a proper minimum to lx relative to admissible arcs in 3* joining its end 
points. 

By the order of concavity of Em will be meant the order of concavity of the 
second variation J 2 (v) along Em. The arc Em will be said to be non-degenerate 
if the order of degeneracy of •/*(»?) along E xt is zero. 

In the following theorem we assume that the end conditions = 0 are of the 
form 

tfv, (*(*»)) * 0, WW = 0 (mi ~ 1, • • •, pi £ n; y 2 - 1, • • •, p* £ n). 

We assume further that E XJ is not tangent to these end manifolds and that the 
end points of E x2 are the only pairs of points on Em satisfying the conditions 
= 0. 

Theorem 27.1. Let Em be a non-degenerate extremal arc ( not intersecting 
itself ) satisfying the end conditions = 0 and the transversality conditions (23.3). 
Let r be the order of concavity and s the sum of the orders of the conjugate points 
of the point 1 between 1 and 2. If Em satisfies the strengthened conditions of 
Weierstrass and Legendre, there exists a set of m = r + s {and no fewer) natural 
isoperimetric conditions (27.1) such that Em affords a proper strong minimum to J 
relative to neighboring admissible arcs satisfying the conditions = 0 and (27.1). 
In order to obtain this result, we suppose that the arc En is cut orthogonally 
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in points Pi, • • • ,P q by regular analytic manifolds r,, • - • , as described in 
the proof of Theorem 26.1, replacing the manifolds r 0 , ir* +1 by = 0, = 0, 

respectively. We shall denote the value of the parameter t of E xi at the point 
Ph by h and set t 0 = t\, We next select a minimal set 

MU rj) = 0 (a = 1 , • • • , m) 

for the second variation J t (i») along £„. It is clear from Theorem 24.2 that the 
number of the conditions in this set is precisely equal to the desired sum r + s. 
We may suppose that the functions £’(<) are of class C 4 . 

Let i»«(0 be a set of broken accessory extremals having its comers at t = r* 
such that 

(27.2) vUr k ) = ^( r *) (A = 0, 1, • • • , q + 1). 
If necessary, we can modify the functions so that the inequality 

Mi; vs)a m a e < •/,(£., < 0 

holds for every set of constants (a) * (0). It follows that the determinant 

(27.3) | vs) - Jt(U {*) | 
is different from zero on En. 

Let be a set of m = r + j functions of class C 4 on 9? satisfying the con¬ 
ditions *„(U = 0 on the manifold - 0 and having tf[x(0) = *‘(|) along E lt . 
The isoperimetric conditions obtained by substituting these functions in equa¬ 
tions (27.1) can now be shown to have the properties described in the theorem. 
In order to do so, let the equations of the manifolds jr A be given by 

(27-4) ...,W (Jk-1, ...,*), 

the parameters (b ik ) = (0) defining the point P k on Since the manifolds 
are regular, the determinant | dX*/db ik | is different from zero for (6) - (0) 
and for all A. It is clear also that end conditions = 0 are equivalent to 
conditions of the form 

( 27 - 5 ) ** = **0>h • • • , 6 # ) (s = 1, 2) 

with | zj*(0) | ^ 0 the end points of E x % being given by the values (6,) = (0). 
Let 

(27.6) x' = x% Ai, ••• , 6 # , 6„, • • • , b, n , Xi, — f X m ) (h £ t £ t 2 ) 

be a (a + qn + m)-parameter family of broken isoperimetric extremals for the 
integral 

h = J.' F dx ~+ 

having its corners on the manifolds v kt satisfying the conditions = 0 , and 
containing E x2 for (6, A) = (0, 0). We may suppose that this family has been 
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chosen so that 

b, X) = x u (b ), b, X) = X*(6) (s = 1, 2; h = 1 , ••• ,q) . 

It follows that if we set = (d/dX^x^f, 0, 0), we have 

(27.7) ni(r*) = 0 (h - 0, 1, ...»q + 1) . 

Moreover, if we substitute the functions x*(t, b, X) in the Euler equations 

F,i - (d/dl)F M * = Pi + X.^P* + X.Litt.) = 0 

for 7 X , where P„ L» are defined by equations (23.1), (24.1), and differentiate with 
respect to X„, it is found that for (6, X) = (0, 0) we have 

L.OU + *.) = 0 

along E X t. The arc vl = l' a + li(<) is therefore a broken accessory extremal 
satisfying the conditions (27.2) by virtue of equations (27.7). It follows that the 
functions differ from those described in the last paragraph by at most px‘ with 
p(*a) = 0 (h = 0, 1, • • • , q + 1) and hence can be taken as identical. By the 
use of equations (27.2) and the usual integration by parts it is found that 

•/*({•! Vt) = Jii 1 ); Vt) 

and hence that 

1 $) = MU vt) — •/*($•, £p) 

= Jt Ou, np) — J*(U £p) • 

The determinant 

(27.8) I MU lit) I 

is therefore equal to the determinant (27.4) and hence must be different from 

^ When the family (27.6) is substituted in the functions Jx(U a set of m func¬ 
tions Jxa{b, X) is obtained having J,.(0, 0) = 0. Moreover, the functional 
determinant of Ji.(6, X) with respect to X,, • • • , X* at (6, X) = (0, 0) is pre¬ 
cisely the determinant (27.8), and is accordingly different from zero., The 
equations «/,«(&, X) = 0 therefore have solutions X. = X.(6) of class C " with 
X„(0) = 0. The family 

(27.9) x* = x'{t, b) = x'\t, 6, X(6)], X„ = X«(6) (*i £ t ^ <j), 

of broken isoperimetric extremals satisfies the equations (27.1) and +> = 0 and 
contains Ext for (6) = (0). Renumber the b’ 8, 6„ ••• , b ,« + „. The variations 

7 ? = 6x i = x'6 fit, 0 )dbj O' = 1» • • • t W + m ) 

accordingly satisfy the conditions Jt(Za, v) = 0 an ^ Hence 

Jttfx) ^ 0. Since Ext is non-degenerate, the equality holds only in case ox' is 
of the form px‘, where P (f.) = p(ft) = 0. But this is impossible unless p ■ 0 
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by virtue of the fact that the arc E xi is not tangent to the manifolds r h . It fol¬ 
lows that when the functions z { (t, b) are substituted in the functional J the 
function J (6) so obtained satisfies the condition 7(6) > 7(0) for (6) (0) and 

sufficiently near (6) = (0) since dJ = 0, d*J = J r {hx) > 0 at ( b) = (0) for 
(db) * (0). 

We are now in position to complete the proof of Theorem 27.1. Let ft' be 
a neighborhood of E x2 and A a neighborhood of (X) = (0) related to E lt as in 
Lemma 27.2. Let 5 be a neighborhood of E x 5 interior to ft' such that every 
set of q + 1 points in ft, one on each of the manifolds (27.4), (27.5), determines 
a set of parameters (6) such that 7(6) > 7(0), (6) * (0), and such that the 
broken extremal of the family (27.9) determined by (6) is in ft' and X«(6) is 
in A. Consider now an admissible C in ft and let Q k be one of the points in 
which the curve C cuts the manifold x*. The points Q h and the end values 
of En determine a set of parameters (6), which in turn determines a broken 
isoperimetric extremal E b of the family (27.9). Since the arc E b passes through 
the points Qs on C and joins the end points of the arc C, we have 

h(C) > h{E b ), 

unless C ■ E b , the multipliers X. being those belonging to E b . Since the arcs C 
and E b satisfy the conditions (27.1) and have the same end values, it foUows 
from the last relation and the above remarks that 

J(C) £ J(E b ) - 7(6) £ 7(0) - J(Eit), 

the equality holding only in case C a E xt , as was to be proved. 

It is clear that none of the conditions (27.1) can be dropped, since otherwise 
Jiita, v) = 0 could not be a minimal set for J t (ij ) along E x7 . The set (27.1) 
accordingly forms a minimal set. From the above proof it follows readily 
that the set (27.1) forms a minimal set for E xt if and only if the equations of 
variations 7 a (f a , tj) « 0 form a minimal set for J 3 (r,). Hence we have the 
following corollary. 

Corollary. If a non-degenerate extremal arc E xi satisfies the strengthened con¬ 
ditions of Legendre, then the number of conditions in a minimal set is the same 
for every such set. 

28. More general natural isoperimetric conditions. In the fixed end point 
case and in the case of closed extremals it is possible to define natural iso¬ 
perimetric conditions of a very general type. In the fixed end point case 
we select a function H(x, x) of class C 4 such that 

(28.1) H(x, kx) = H{x, x) (* > 0) , H(z h x) . H{n, y) 

for all non-null values of (x) and (y), where (*,) and (*,) are the coordinates of 
the fixed points 1 and 2. Let «*, x) be a solution of the equations 
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The condition 

(28.2) K(i) = !‘‘ |/,? + (H+ - I dt = 0 

is a general natural isoperimetric condition. It is clear that this condition is 
satisfied by every extremal joining the points 1 and 2, since along such an arc 
we have 


K(() - f‘‘ \dH/dt + i'lfj - (d/dt)fji\\ rft = 0. 


If f{x , x) > 0 we can always choose the functions so that // = $‘( x , i)f i i, at 
least in the non-singular case. The condition (28.2) can then be written in the 
simpler form 


| * - 0 , 


as one readily verifies. The natural isoperimetric conditions described in §25 
are of the special type in which the function H is of the form // = (*( x)f #. 

It is clear that by the use of the general isoperimetric condition (28.2) mini¬ 
mal sets can be defined, as above, for a non-singular extremal arc. More¬ 
over, the equation of variation of the equation (28.2) is expressible along 
in the form 

Jt& ») - 0 , . 

where Jj(£, v) the usual bilinear form for the second variation Jj(v), and 
= (‘(ar((), x(()|. It is understood, of course, that V(*0 = n'(fj) = 0. From 
this fact it follows readily that the theorems described in §§25 and 27 are still 
true in the fixed end point case if we replace the natural isoperimetric conditions 
there used by the more general conditions there defined. We have accordingly 
the following result. 

Theorem 28.1. If the strengthened condition of Legendre holds on a non¬ 
degenerate extremal , the number of conditions in a minimal set of general iso¬ 
perimetric conditions is always the same , and is equal to the sum of the orders of 
the conjugate points of the point 1 on Eit. 

In the case of closed extremals general isoperimetric conditions can be defined 
as above by omitting the second condition (28.1). 

In the study of geodesics on a closed convex surface H. Poincar6 showed that 
there is a shortest closed curve C on S which satisfies the condition 


(28.3) / / Kdo = 2x 

and that C is a geodesic. Here K denotes the Gaussian curvature of S and the 
integral is taken over one of the regions bounded by C. This condition can be 
transformed into one of the type here described. In fact, if we use isothermal 
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coordinates (x, y) (x being an angle of period 2so that the element of arc 
becomes 

ds* = X*(dx’ + dy>), 

condition (28.3) can be shown to be equivalent to a condition of the form 

L 1 ^ - X-IfOA — z'] dl mz 0. 

This is a natural isoperimetric condition defined by the function 

H = arc tan (y'/x') - x . 

V 

The case of multiple integrals 

Many of the results described in the preceding pages can be extended at 
once to multiple integrals. A complete theory for multiple integrals is not to 
be expected since such a theory does not exist even for the classical minimum. 
However, by the use of natural isoperimetric conditions one can go a long way 
towards a complete classification of extremals which do not satisfy the Jacobi 
condition. To illustrate this theory we shall use the non-parametric double 
integral problem.’ 1 


29. The double integral problem. Let A be an open region in the xy-plane 
bounded by a simply closed continuous arc C. Let 5 be a surface defined over 
A + C by an equation of the form 

(29- 1 ) ^ - *(*, y) |(x, y) on A + C ]. 

In the following pages we shall be concerned with the classification of the ex¬ 
tremals of an integral of the form 


J 



y, z, p, q)dxdy t 


where p = dz/dx , q = dz/dy. 

The function /(x, y, z, p, q) is assumed to be of class C 4 in a region 9f of 
points (x, y, z, p, q). We shall suppose that the boundary C of A is composed 
of a finite number of regular sub-arcs. A surface S will be called a regular 
surface if the arc C bounding A in the xy-plane is regular and the function 
z(x, y) ^ of class C 1 , or if it is composed of a finite number of partial surfaces 
having these properties. 

By an admissible surface will be meant a regular surface, all of whose elements 
(*, y> P, Q) are in % and which passes through a fixed curve L. The curve L 


" In this P art free ^ has been made of the unpublished lectures on multiple integrals 
given by Bliss at the University of Chicago, summer, 1933. 
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is assumed to be a simply closed continuous curve composed of a finite number 
of regular sub-arcs, and having the boundary C of A as its projection in the 
ary-plane. 

The Euler equations of the integral J are the equations 

/. - O/ax)/, - O/dy)/, = 0. 

A solution (29.1) of these equations of class C" with elements (x, y, z, p, q) in 9? 
will be called an extremal surface. 


30. The accessory problem. Along an extremal surface S the second varia¬ 
tion of J is expressible in the form 





2«(x, y, v, Vw) dx, 


where 


2w - f„ rf + 2f ip w, + 2/„ nv t + /„ + 2 w, + /„ . 

A variation rj(x, y ) will be termed admissible if it is a regular surface vanishing 
on the boundary Col A. The Euler equations for this integral are the equations 

L ( ri ) = U>, - (d/dx)w„ - (d/dy)w„ = 0 , 

and will be called the accessory equations. A solution of class C" of these equa¬ 
tions will be called an accessory extremal. 

The quadratic functional J t (i)) has associated with it a bilinear functional 

Mi ,») - j j i) dz , 

A 

where 


n) =* fa + + fan* • 

If the function {(x, y) is of class C" and vanishes on the boundary C of A, the 
condition 


Mi, *) = o 

will be called a natural isoperimetric condition for the functional It is 

satisfied by every accessory extremal »j(x, y), since by the use of Green's theorem 
it is found that in this case 

Mi, v) = J c ifa. dy - dx) = 0. 

Conversely, a variation v(x, y) of class C" which satisfies all natural iso- 
perimetric conditions is an extremal surface. This follows from the funda¬ 
mental lemma, since now we have 
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Jt(Z, v) = JJ Mv)Z dx dy + £ $( w „ dy — dx) = J J L(n)Z dx dy = 


0 


for every set of admissible variations ( of class C". 
Consider now a set of natural isoperimetric conditions 


(30.1) Jt(Uv) = 0 (0=1 , 

The Euler equations of J t subject to the conditions (30.1) are the equations 

L(v + usZe) = 0, = o. 

A solution v, M of these equations of class C" will be called an isoperimetric 
extremal. 

The set of conditions (30.1) will be called a proper set of natural isoperimetric 
conditions if the determinant 

I Zs) I (a, 0 = 1, ••• , m) 

is different from zero. 

Lemma 30.1. If the set (30.1) is a proper set of natural isoperimetric conditions, 
every isoperimetric extremal ij { , satisfying these conditions has p, = 0. 

For, by the use of Green’s theorem, and from the fact that ( a = 0 on the 
boundary C of A, it is found that 

HU l) = jjiMv) dxdy - If (Mi,)dxdy . = 0, 

A A 

and hence that = 0, as was to be proved. 

The set of conditions (30.1) will be called a minimal set for /*(*) if the in¬ 
equality J t (rj) £ 0 is true for every admissible variation t?(i, y) satisfying the 
conditions (30.1), and if no proper sub-set of these conditions has this property. 

Theorem 30.1. If the set (30.1) is a minimal set, there exists a proper minimal 
set composed of the same number of natural isoperimetric conditions. 

This result can be proved by the methods in §3 used in the fixed end point 
case for simple integrals. In a similar manner we obtain further results. 
Theorem 30.2. If the set (30.1) forms a proper minimal set, the inequality 

(30.2) $4)a a a tf <0 (a, 0 = 1, • • • , m) 

is true for every set of constants (a) * (0). Conversely, if the set (30.1) forms 
a maximal set of natural isoperimetric conditions such that the conditions (30.2) 
hold, the set (30.1) forms a minimal set. 

Theorem 30.3. The number of conditions in a minimal set of natural iso¬ 
perimetric conditions is the same for every such set. 

In view of the last theorem we may define the type number of the functional 
Jt(v) to be equal to the number of isoperimetric conditions in a minimal set, if 
such a set exists. If there is no minimal set, the type number of J 2 (v) will 
be said to be infinite. 


193 



280 


G. D. BIRKHOFF AND M. R. HESTENES 


Theorem 30.4. If the set »?.(x, y) (a = 1, • •• , m) forms a set of m ad¬ 
missible variations satisfying the conditions 

(30.3) * 7 i)a a a 3 <0 (a, 0 = 1 , • • • , m) 

for every set of constants (a) j* (0), the type number of J s is at least m. If these 
functions form a maximal set of such variations, the type number of J, is equal to m. 

This follows at once from Theorem 30.2, since the functions ij a (x, y) can be 
replaced by admissible variations £«(x, y) of class C" without disturbing the 
inequality (30.3). 

The condition of Legendre on a surface S will be said to hold if the inequality 

Q(x, y, 0 ) = f pp cos* 0 + 2 f„ cos 0 sin d + /„ sin* 0 £ 0 

is true on S for every constant 0. If the inequality sign is excluded, the strength¬ 
ened condition of Legendre will be said to be satisfied on S. 

Theorem 30.5. If the type number of is finite, the condition of Legendre 
holds on the extremal S. Moreover, if the expression f PP f qq — f* pq is different from 
zero on S, the strengthened condition of Legendre holds on S. 

To prove this let m be the type number of J 1 ( 17 ), and suppose the theorem 
is false. Then at some point (io, yo ) and for a particular value 0 O we could 
have Q(x 0 , yo, 0o) < 0. In fact, there would exist m + 1 points P a ■ (z«, y a ) 
(a = 0 , 1 , • • • , m) such that 

Q(x a , y„, 0o) < 0 (a = 0, 1 , , m). 

With each of these points P a as a center we could describe a set of m + 1 
circles h„ in A which do not intersect. By the methods used by Mason” we 
could construct a set of m + 1 admissible variations »j.(x, y) having 7 j a ■ 0 
outside the circle h a and having J(va) < 0. It would follow that 

1?«) < 0, T ) s ) = 0 (a 0; a, 0 s 0, 1, ”' , m ), 

and hence that the type number of Jt(ri) would be at least m + 1, by Theorem 
30.4. From this contradiction we infer the truth of Theorem 30.5. 

A continuous curve C composed of a finite number of regular sub-arcs will 
be called a conjugate curve if there exists an accessory extremal q(x, y) having 
i) s 0 on C but not both rj x a 0, ij t m 0 on C. 

We have the following interesting theorem concerning these curves. 
Theorem 30.6. Suppose the strengthened condition of Legendre holds on the 
extremal surface S. If the type number m of the functional «/*(» 7 ) on S is finite, 
the number of curves in any set of non-intersecting conjugate curves cannot exceed m. 

To prove this let C« (a = 1, • • • , p) be a set of non-intersecting conjugate 
curves and rj a (x, y) be the corresponding accessory extremals having v„ = 0 
on C a and ( rj a x, ^ (0, 0) on C a ■ Let P a = (x c , y a ) be a point on C a at 

which (i 7 „, rjay ) * (0, 0). With P a as a center, describe a circle h a which does 

” See Bolza, Vorlesungen uber Varialiorurechnung, pp. 673-5. 
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not intersect the curves Cg(fi a). Let u a (x,-y) be an admissible variation 
which is identically zero except on the circle h a , and is such that 

(30.4) / uJw'.Gl.) dy - «*(*.) dx] * 0 . 

Jc a 

Such a function surely exists, since otherwise the expression 

Wish*) dy - "*(’)•) dx = (fppija, +/,*!?«*) dy — (/py rjax -f fnlmv) dx 

would be identically zero, by virtue of the fundamental lemma in the calculus 
of variations. Moreover, on C« we have 


dx 4- i) ay dy = 0, 


since rj a = 0 on C a . But these relations and the Legendre condition would imply 
that (vat, lay) = (0, 0) at P a , which is not the case. The functions u a (x, y) 
accordingly can be chosen as described. 

As a next step in the proof of Theorem 30.6 we set ij a = ij a within C« and 
identically zero elsewhere. By the use of Green’s theorem it is seen that 



V.) dx dy 


= / *J"«»(i|«) dy - «*(*.) dx) 
J c a 


0, 


where A a denotes the region bounded by C.. If now we set = i) a + b a u a 
(a not summed), it is found that 

Jt(ta) = Jt(%) + 2b a J t (f, a , u a ) + b*J t (u m ) 

= 26 a 7j(fj -f u.) 4 - blJi(u m ) (a not summed). 

By the use of Green’s lemma again one sees that u a ) is equal to the ex¬ 
pression (30.4). We may accordingly choose 6. such that ,/,({„) < 0. We 
shall now prove that */,({., {,) = 0 (a * /?). This result is clearly true if the 
curve C ff does not lie in the region A 0 bounded by C«. If Cg is within A a we 
have 


•/*(£«, ifl) — f}o) + hfiJitia, Ug). 

By the usual integration by parts it is seen that 

ns) ■* J j «(£«, V g) dx dy 


j j vsHU) dxdy + £ iiJavft.) dy - «*($.) dx] = 
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Similarly, we have Jvs) = 0, and hence Ml*, $,) = 0 in this case also. 
We have accordingly 

Jtea, ia) < 0 , Jt(U U « 0 fi;a, 0 ml,... ,p). 

It follows from Theorem 30.4 that p cannot exceed the type number m of Ji(n), 
as was to be proved. 

It is clear that this last result can be extended to sets of conjugate curves 
which may have points or sub-arcs in common. 


31. The existence of mi n i m al sets. We shall now show that under certain 
conditions, which are known to be satisfied in many cases, the type number of 
the functional J 1 ( 17 ) is finite. We assume that the boundary C of A is regular, 
and that the strengthened condition of Legendre holds. 

The accessory boundary value problem is given by the system 

(31.1) L(u) = au, u s 0 on C. 

A value of a for which this system has a solution (u) & (0) of class C" on A 
will be called a characteristic root, and the corresponding function u{x, y) will 
be called a characteristic solution. 

Suppose now that there exists a sequence of characteristic roots 

(31.2) a, £ Z ... £ a** ... 


with lim <y„ = 00 and a sequence of corresponding characteristic solutions 

Ml, tit, • • • , ti„ • • • 

such that every function ij(x, y) of class C" with 17 ■ 0 on C can be expressed 
by a uniformly convergent series in the form 

(31.3) t)(x, y) = fc*u,, 

k-l 


where the c’s are constants. We suppose further that 

(31.4) J J UiU k dx dy = 6 ik [&« = 1, $i* — 0 (t ^ k) ]. 

A 

An immediate consequence of this assumption is that 


(31.5) 


c k 



Iju k dx dy, 


as one readily verifies by multiplying the equation (31.3) by u, and integrating 
term by term. 

Theorem 31.1. Let m be the number of negative characteristic roots in the 
sequence (31.2). If we set = u„ (a = !,•••, m), the inequality 


(31.6) 


Ji(U hjOaO,3 <0 (a, 0 = 1, • • • , m) 
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holds for every set of constants (a) * (0). Moreover, J t ( v ) ^ 0 for every ad¬ 
missible variation jj(z, y) having ij = 0 on C and satisfying the conditions 

(31.7) (a = 1, • • • , m). 
To prove this we first note that by integration by parts we have 

(31.8) */*(£«> n) = JJ vL(ta) dx dy = o a J J dx dy , 

A A 


since ( a = u a . It follows from equations (31.4) that 


J(t., {.) - a. < 0, J,(!„ {,) = 0, 


and hence that the condition (31.6) is satisfied. 

Consider now a surface ij(x, y) of class C" having v = 0 on C and satisfying 
the conditions (31.7). Such a surface is expressible in the form (31.3). The 
first m constants in this expression are all zero, since 


c a 


j j tii.dxdy = (1 /».)/({., _ 0 

A 


(a not summed). 


If now we multiply the series (31.3) by L(r,) and integrate, we find that 
[ j ijI'M dx dy * ^ c k [[u k L(n) dx dy . 

A *—+1 \ J 


From this expression we see by integration by parts and the use of equations 
(31.1) and (31.5) that 

J(l) - £ *kC k £ 0 . 

k-m+l 

The equality holds only in the case 


V - Cm+iU M + ••• + Cm+'Um+t, 

where *«+1 = • • • = *.»+. = 0, <r *+«+1 > 0. This proves the theorem for sur¬ 
faces i) of class C". 

In order to show that the theorem holds for all admissible surfaces vanishing 
on C, we note that every such surface *j(x, y) is a limit of a sequence of ad¬ 
missible surfaces Vk (x, y) of class C". Let n fik be a set of constants satisfying 
the conditions 

<r„Hak = Vk). 

From this equation we see that lim ti ek = 0 if r, satisfies the conditions (31.7). 
It follows that the sequence ij k - Z 0 n Bk satisfies the conditions (31.7), has 

Jt(vk - Swek) ^ 0 (A: = 1,2,--.), 

and has v (x, y) as its limit. Hence J 2 (rt) ^ 0, as was to be proved. 
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Corollary. The set (31.7) forms a minimal set, and the type number of Ji(rj) 
is equal to m. 

32. Natural isoperimetric conditions in the general case. We shall now re¬ 
turn to the functional J described in §29. If £(x, y) is an admissible variation 
of class C ", the condition 

MO = jf If/. + U. + f./.l dxdy = 0 

A 

will be called a natural isoperimetric condition. This equation is satisfied by 
every extremal S, as can be seen by the use of Green’s theorem. Conversely, 
every admissible surface of class C" which satisfies all natural isoperimetric 
conditions is necessarily an extremal surface. This result is readily obtained 
by the use of Green’s theorem and the fundamental lemma for double integrals 
in the manner described in §30. 

Consider now a set of m natural isoperimetric conditions 

(32.1) Ji(*#)-0 1, ...,«). 

Such a set will be called a minimal set for an extremal S if S affords a weak 
minimum to the integral J relative to neighboring admissible surfaces satis¬ 
fying the conditions (32.1), and if no proper sub-set of these conditions has 
this property. The conditions (32.1) will be said to form a proper set of natural 
isoperimetric conditions for S if the determinant 

(32.2) | Ml., *,) | 

is different from zero on 5. In this case there are infinitely many admissible 
surfaces satisfying the condition (32.1), as can be seen from the proof of the 
following theorem. 

Theorem 32.1. If the set (32.1) forms a proper minimal set for an extremal 
surface S, £ 0 on S for every set of admissible variations v satisfying on S 
the conditions 

(32.3) Jt(S; v) = 0 (a - 1 , • • • , m). 

To prove this let z(x, y) be the function defining the surface S, and t?(x, y) 
be an arbitrary admissible variation satisfying the conditions (32.3). When 
the function 

(32.4) z(x, y) + a a £.(x, y) + br,(x, y) 

is substituted for z in /,($«), a set of m functions Ji«(a, 6 ) is obtained, for 
which J i„(0, 0) = 0. Moreover, the functional determinant of Ji«(a, b) with 
respect to the variables a b • • • , a m at (a, 6) = (0, 0) is the determinant (32.2), 
and is accordingly different from zero. It follows that the equations 
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Jia(a, b) = 0 have solutions a a = A„(b) of class C' with A„(0) = 0. By differ¬ 
entiating the identity 

y,«U(6), b) 3 o 

with respect to 6 it is found that for 6 = 0 

0 = MU 0 ) + MU v) = MU MAIM, 

and hence that A ^(0) = 0. It is clear that the one-parameter family of ad¬ 
missible surfaces 

*(r, y, *>) = z(x, y ) + A a (b)U*. y) + bij(x, y) 

satisfies the equations (32.1), contains the extremal S for b = 0, and has 
2 b{i, y, 0) = tj(j, y). It follows that the function J{b) obtained by substi¬ 
tuting this family in the functional J satisfies the relation ./(/>) £ 7(0), since 
the set (32.1) forms a minimal set for S. We have, accordingly, J'{0 ) = 0, 
7"(0) = 7a(«?) ^ 0, as was to be proved. 

Theorem 32.2. If the set (32.1) forms a proper minimal set for an extremal S, 
the condition of Legendre holds on S. 

This result follows at once from Theorems 32.1 and 30.5. 

A surface S will be said to satisfy the condition of Weierstrass if at each ele¬ 
ment (x, y, z, z t , Zy ) on it the inequality 

E (*> y, z, 2 m, Zy, p,q) £ 0 
is true for every admissible set (x, y, z, p, q), where 

E " !{I - V’ *• P • 1) - Hz, y, z, z„ z.) - (p - z.)/,Cr, y, t , ,,) 

“ (V - y, 2, 2 m, Zy). 

Theorem 32.3. Suppose the set (32.1) forms a proper minimal set for an 
extremal surface S. If S affords a strong minimum relative to neighboring ad¬ 
missible surfaces satisfying the conditions (32.1), the condition of Weierstrass j holds 
on o. 


In the case that S forms a minimum in the classical sense the theorem is 
usually proved by showing that if the Weierstrass condition fails to hold at 

U wellTfi f’ 'l' e l C * XB,S a ° ne ‘ paran "' ,pr of surfaces v (x, y, b ) which 
>s »c" defined for b g 0, contams S for b = 0, and is such that J'(U) < 0 for 

the function J(b) obtained by substituting this family in the integral J.» We 
shall use this fact to establish the more general theorem here given 

J/,Z like ,hat UScd in ,he ' )roof of Theorem 32.1 it is readilv 

the family oUdmiible surf^"'' ^ ^ C ' tor b - 0 such ,hat 


z(z, y, b ) = :(x, y ) + A.(b)(.(i, y) 

u Bliss, loc. cit. 
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is well defined for 6^0, satisfies the conditions (32.1), and contains S for 
6 = 0. If now we substitute the functions z + M » for z in the integral J , a 
function J(a, 6) is obtained such that 

J[A{b), 6] £ J(0, 0) 

for 6 ^ 0 and sufficiently near zero. It follows that we must have 

M(0, 0)A' a (0) + M0 t 0) £ 0. 

The values J aa (0, 0) are the values of the functions J i(£«) on S and hence are 
zero. Hence we have M0, 0) £ 0. But J k (0, 0) is equal to the derivative 
J'(0) of the function J(b) described in the last paragraph and is accordingly 
negative. From this contradiction we infer the truth of Theorem 32.3. 

Harvard Univirsitt. 


200 
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SINGULAR POINTS OF ORDINARY LINEAR 
DIFFERENTIAL EQUATIONS* 

BY 


Let 


( 1 ) 


GEORGE D. BIRKHOFF 


2 ‘-£•.<*>*, 




be a system of n ordinary linear differential equations of the first order in which 
we have for | a; | > R 

( 2 ) + + + + u.jmi, i, ■■■, »), 


q being an integer. We propose to investigate the nature of the solutions of 
( 1 ) in the vicinity of x » ao, which is taken to be a singular point of the given 
system of equations so that q ^ - 1 . The integer q + 1 is termed the rank of 
(1) at a; = oo, after PoiNCARft.f It will be further assumed that the n roots 


<*,» <*., •••, a m 

of the characteristic equation 

(3) K-$<,«|-o 

[where B (J = 0 (* 4 =j), and 8 . = 1 ] are distinct, and in the case q = — 1 do not 
differ by integers. The writer hopes to discuss the general case later. 

It is apparent that any proper linear transformation 

in which the functions \(r) are analytic or have a pole at x = oo, will trans¬ 
form ( 1 ) into a system of equations (I) with coefficients « w (*)of the same form 
(2), although q is not necessarily equal to q. Any such system of equations (I) 
will be said to be equivalent to ( 1 ) at r = oc. 

The first part of the paper deals with the determination of the simplest sys¬ 
tem of differential equations which is equivalent to (1) at x= oc. In § 1 an 
impor tant Lem ma on Analytic Functions is stated and proved. The applica- 

* ,>resented *> the Society (Chicago), April 17. 1908, under a different title, 
t Aota Mathematics, vol. 8 (1886), p. 305. 
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tion of this lemma in § 2 shows that there always exists a canonical system of 
equations equivalent to (1) at x = x having the form 



•/=! 




in which P 0 (x) are polynomials. In §3 it is shown that the rank of this 
canonical system of equations at r=oc can be taken to be 7 so that the degree 
of the polynomials P, ; {x) docs not exceed 7 + 1 . 

The significance of these results is best seen by considering the case of a reg¬ 
ular singular point 7 = — 1 . The canonical system of equations equivalent to 
( 1 ) is then in simplest form (§ 3 ) 


•IV, xr 

<lY t „ 

*d * = 

dr. v 

■'dr-*- 1 '- 

and a fundamental system of solutions is 



r-.o. 

v: =0, 

r-=o. 

r: =x~-. .... 

e 

0 

II 

• • • • 

r,-=o. 

• ••••• 

IV- 0. .... 

IV-*-. 


Now the solutions of ( 1 ) and the above system of equations are connected by a 
relation 


*)*, 

j-i 


i = 1,2,•••,«), 


in which the X v (r) are analytic at r = x (or have a pole, but this case is 
easily excluded). It follows that there exists a fundamental system of solutions 


of (1). 'Jt\ 'Ji 


»!/f (j-1,2, •••,«).of the form 


y/'-LMaW-M*)*’’ (<•.;=1,2, 


i»i 


This is the fundamental theorem for a regular singular point. The solutions 
of the canonical system of equations equivalent to (1) at x = oc play the same 
rule in the case of an irregular singular point (7 ^ 0 ) that they <lo in the case 
above considered. 

The nature of the solutions of the canonical system of equations can be dis¬ 
cussed by special and elegant methods when 7 = 0. In the case 7 = 0 it is 
known that the solutions of any system (i) with rational coefficients can 

be expressed by a Laplace integral 

y.= fe*ij.(z)dz (i=l, 2, •••, n), 
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and that from this formula can be deduced the behavior of the solutions in the 
vicinity of o- = oo. This method of treatment is due to PoiNCARfc,* and has 
been developed more fully by HoRX.f It is applicable in an extremely simple 
form to canonical systems of equations for which q — 0. If q > 0, generalized 
Laplace integrals of a simple character will be shown to be available. The 
integrals for q = 0 are considered in §4. 

These integrals are employed in §5 to find the asymptotic nature of the solu¬ 
tions of the canonical system of equations equivalent to (1) at x = oo in certain 
sectors of the a-plane. The asymptotic form of the solutions of (1) in the same 
sectors follows as an immediate consequence. The powerful and general method 
of successive approximation which Horn* has employed in various cases will no 
doubt lead to the same results. 

A Kiemannian characterization of the solutions of the canonical system of 
equations in the vicinity of x = oo is given and extended in § 6 to the solutions 
of (1). The number of characteristic constants in the solutions is found to be 
the same as the number of arbitrary constants of the canonical system of 
equations. 

'l'lie characterization suggests the probable generalization of the Kiemannian 
problem when the singular points of the system of differential equations are not 
regular. This formulation and a count of constants is given in § 7. 

§1. Auxiliary lemma. 

In order to obtain the simplest possible system (I) equivalent to (1), we can 
choose as we please the n’ functions V of the transformation (4) but with the 
restriction that they are analytic or have poles at x = oo and that the detenni- 
" a "t I \/l * 8 not identically zero. How can one choose the functions \ so as to 
free as far as possible the solutions of (I) from singularities in the finite plane? 
The answer to this question, which will be fundamental in o.ir study of (1), is 
derived from the following lemma. 

Lemma on analytic functions. Let 6 . (x) be a set of functions ofx , 
single-valued and analytic for |x| > II but not necessarily analytic at x = =c, 
and also such that (he determinant 


!*,(*)! =M 


(|x| > R). 

Then it is possible to so choose n* multipliers \(x) % analytic at x = oo, that 

t (or ± <L(x)X.,(x) = x-{,,(*)) 

—- . (•'■* = 1 , 2 , •■,»), 

AinericaD Journal of Mathematics, vol 7 nn on** ot;a * » «• 

a' ’ ,,h , 0rd " ^ tr “ , " i iD ' hiS ‘‘ , " i 0,h " to later" 

I Matheinatische Annalen, vol. 50(1897), pp. 525 - 556 . 

JCrelle’s Journal, vol. 133 (1907), pp. 19-67. 
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where m is a fixed positive integer or zero , and the {,*(*) are entire functions 
of x for which the determinant |{,*(0)1 + 0* 

Proof. The two alternative forms of the Lemma are clearly equivalent. 
We shall confine ourselves to the pi-oof of the second of these, which is the 
easier to give in matrix notation. 

In order to elucidate the lemma let us consider first the case n = 1. We 
have to determine X„ so that 

^ii \i = x "fir 

Since 0 t fx) 4 = 0 for \x\ > R % the function log d xx is expansible in a Laurent 
series for |x-| > R excei>t for a term a log x, 

♦ m 

log = £ r,*' + a log x. 

rm—m 

Take then 

X„ = e , m = a, {„= e"“ , 

if a < 0, or 

—■ +• 

_ 2 v 

X„ =» e — 1 , m«0, I,,-**®”* s 

if a 2 0, and we have the desired solution. This explicit treatment is not pos¬ 
sible if n > 1, in which case we shall have to construct a solution by an infinite 
process. 

As it is convenient to use matrix notation, let 0 denote the matrix \\$ if \\ and 
©-» its inverse matrix ||0J|| which is of the same form, since |0J + 0 for 

|x|>i?. Also we shall write A = || Xj|. 

Furthermore, if <f> is single-valued and analytic in the vicinity of x - oo , so 
that it is expansible in a Laurent series, 

*= L 

rm—m 

we write 

?(*) = £«/, -AW=E a .*’. 

r>« **-• rM “" 

Here m is some positive integer whose value will be specified later. We have 
the obvious relations 

</> = P(4>) + N(4>) = />(*) + + ■•• + + A’.W. 

P [iV(^)] =/’[JV.(^)] = JV[P(*)] = i^.[P(^)] = 0. 

More generally, if we have a matrix 0 a ||$J, we shall use the notation 

P(0) S ||P(^)||, ^(•)-||JT(^)||, iV.(0) s ||A.(^ ; )||. 

Trans. Am. Math. Soc. 29 
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Let now T= || ty || be any matrix of constants of determinant not zero. 
Consider the matrix equations 

(5) r = P( 0 - , 'j') * = Tx-~+ AT m (®.r). 

If a solution of these equations T » || 7 j|, V = || exists , we mil have at 

once the desired matrix 

( 6 ) A = N( 0 _l .- 'F). 

To verify this statement we notice first that the functions \ are analytic at 
x = oo by ( 6 ), since they are convergent series of negative powers of x. More¬ 
over from the first equation of (5) we obtain 

(7) T -f W(0 _l ■¥)« 0 -* 1 • 

From this equation (7) we derive 

( 8 ) 0-T-f 0[iV(0-«.'!')]=* 

on multiplying by the matrix 0 . Furthermore, by the second equation of (6), 
we obtain also 

(9) V— 7lr.~ m + 0-T -P(0 T)-J, 

in which the elements of Ii are polynomial of degree m — 1 at most in 1/x. 
If we add ( 8 ) and (9), we find by means of ( 6 ) 

0 A = Tx— -P(e.T)- Ii. 

The right hand member forms a matrix j|*— f„(x)|| in which the f (») are 
entire functions of x with the desired property 

lf«(°)l-l*J + o. 

We have thus proved our statement. 

It remains to construct a solution of equations (5) for some m. This we 
do by a method of successive approximation. 

Write 

r o =»p( 0 -'.y o) , 

(10) v 1 -a- + jvr.(e-r #)l 

r, = p ( 0 -' -v,), 

^- + iv,(© r,>, 

**•••• 

If the sequence of matrices V , r, tend to limit matrices, it will not be difficult 
to prove that these limits form the desired solutions. 
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Define the difference matrices 


r=r-r.,. r„ = r„. 



These matrices for p > 0 satisfy the equations 

(11) VA'. ( er rl ), r =P(0-*.^). 

It follows from the first of these equations that S' contains only negative powers 
of x. Hence the positive powers which come from the multiplication of (-)’ 1 and 
'P in the second of these equations arise from the positive |>owers in the ele¬ 
ments of (") -1 and from the negative in so that 

( 12 ) 

Likewise we find 

(13) A’.(»T,. l )-.v„[iV.(e)T^ 1 ], 

since the elements of contain only |>ositive power* of x and a constant term. 
Thus ( 11 ) can be written 

( 14 ) * =;V [.V.(6).f r.-i>[P(6-')•¥,]. 

In this form we shall prove the convergence of 


r-r, + r, + —, 

for large enough m . 

Choose R B so that 

r 0 > R 9 > R ; 

then we can choose M so large that* 


( 15 ) 


*(*;>«-j; 


-v(i + r ' + r* + 





since P( 0 ~ l ) is an entire function of x. Moreover since N{ 0 { j) is analytic for 
|x| > R, the series for jV(^), 



•'The notation, fl , means that the coefficient* of the various powers of r in the power 
series a do not exceed the corresponding terms of ,1 in absolute value. I shall extend this nota¬ 
tion of Poincare to matrices A and B of which corresponding terms ay and are power 6eries. 
Thus the notation A « B implies that ay ^ for all values of i and j. 


% 
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is convergent for |ar| > 7?, so that 


-“ 0 


Hence for large enough m we have \Q£\ j) > m, no matter how small 

t be taken. Thus we can choose m so large that for any small e 


*•<'.>«,' -‘(x , + 5 + "> 


Take m so large that 


n~eMr 9 It 


1 ' o-.-x.Y" 1 - 

It is to be noticed that both the comparison series used converge for 
r 0 >|x|>/7 0 . 

Now choose L so large that 

a 8 ) r„=r.«:;-^-_|!. 


First by (16) and (18) we see that 

^, = ^v.[iv„(©).rj 


i-- i 


«W N ( ntLr *R. _\ „ r «z,r„y? / i i \-i 

<JI_ 1 . 

Next, by (15) and the above inequality, we see that 

r,=/>[/’(©->) ^.l« l| p[,- •^•XLrjR. _1 Ij *UMLr*B, j 

ii L (r-X.){r.-*)(*-lt,) J « | j j^,)^ 

as before. And thus in general 


n i ''-'€’M*-'Lr> 0 R' I 


!(>•.--«o )"-■(*-RJ ' r<< ,l(r a -B.)’'(r 0 -x) ,' 

as may be proved by induction. If (17) holds, we shall have convergent sene, 

for 7j/ and ^ if r,>!x|>^ 0 . 

Furthermore, from the equations (10) we infer that 

_ r = P( 0 -'.qr) > 

•The compound matrix bos n terms in each element j hence we get » multiplier n. 
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so that must be an entire function of x. Then likewise we infer that 

v = ri- + ^(0T). 

Thus ll^ll and ||*„|| satisfy (5> 


§ 2. Existence of an equivalent canonical system. 

We are now in a position to answer the question which was raised at the 
beginning of § 1. 

By the hypothesis concerning (2) there are no finite singularities of the func¬ 
tions a.fix) of (1) exterior to the circle |x| = R. Let x describe a closed path 
exterior to this circle about x = co. It is easy to show that there will exist u 
linearly independent particular solutions 

yV\ •••, tf', 


y;\ •••. }/:\ 

which undergo the substitution 


(19) . 

ip-mp. fr-wt*. 

unless two or more values of p are equal, when degenerate cases may occur.* 
We will suppose for the present that there exists a fundamental system of 
this character (19) for which, of course, 


W'\ 




Define now functions 6 fix) by the n* equations 





2w V 


7ri log 4 


It may be inferred at once that the d. } are single valued and analytic for 
| x | > R and that 


IWI 




Thus the O.fix) fulfill the restrictions of the Lemma. Hence we can deter¬ 
mine a set of functions X t> (x) analytic at x = oo in such a manner that 


the f lt (x) being entire functions of x and |f (1 (0)| 4 = 0* 

Mt is here simply a matter of the canonical form of a linear substitution. 
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The transformation (4) changes (1) into a new system of equations (I) with a 
fundamental system of solutions 

f o ( x ) {ifjs= 

Therefore we can transform a system (1) into an equivalent system whose 
solutions have no other singularity in the finite x-plane except x = Q, and this 
singularity will he of the above simple form. 

The new system of equations can be written 

d Hi - 

y < y. 

ir a" ••• v." 


( 20 ) 


i 

W 


(tar' 

ir v 


T/«> 


9: 


= 0 (f = l, 2, •••.«). 


Hence we see that the coefficients u^x) are quotients of entire functions, and 
therefore ineromorphic in the finite plane with poles only at joints for which 

On the other hand, by a fundamental property of any transformation (4), the 
new system of equations (20) is of the same form as (1) so that the S u (z) are 
analytic at x = oo, or have a pole at that point.* 

These two properties of the functions show that they are rational functions 
of x. Thus the new system of equations equivalent to (1) at x = oo ca/i be 
written 


( 21 ) 


d £-t,R^)y r 


in which the coefficient s J?„(x) are rational. Furthermore it possesses a funda- 
mental system of solutions 

<* 8 > *$«=•*-*,(*), 

where the ?„(x) are entire functions of x and |f (0)1 * 0, and the poles of 
•«(,(*) he at points for which 

A = lt//>| = 0. 


•Write the transformation inverse to (4) in the form 


*• 


zz'Z ba,e a po,e at ,ut p ° int - a 
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By a succession of transformations (4) we shall now remove all the singular 
|K)ints of (21), not at x=0 ami x=oc. These can only he apparently 
singular points because of the character of the fundamental system of solu¬ 
tions (22). 

Since from (21) we have 


_ I*,,(«)+*£">+ ...4* am U)W* 


it is clear that the entire function A is the product of a factor which never van¬ 
ishes, and of factors of the form (r — r')\ X l>cing a positive integer. The 
singular points will lie at .»• = /. These last factors can be removed as follows. 
Choose constants c, , c., • • •, c , not all zero, so that for x = r 

(23) u i.«.•••.») 

This is possible because by hy|>othesis A = 0 for x. = x . Let c k be one of the 
quantities c t which is not zero. Make the further transformation of the form (4) 

Di-Vi (•' + *), 

- Wi + <‘ i y* + + 

!tk — .. TP • 


The fundamental system of solutions y'*of (21) is transformed into another 
such system of the same character, 

sv'-y/* «+*>. 

™ X — X 


Accordingly the new system of equations has again the form (21). But we 
have 


so that a factor x — x has been removed from A. 

In this way one can remove in succession each factor x — x\ and finally reach 
the case when A does not vauish except perhaps for x = 0. Thus a system of 
equations (21) exists whose solutions have the additional property 

(24) |j?’| + 0 (x + 0). 

It follows that the coefficients R,j(x) cannot have any poles except at x = 0 
and * = oo, and so are either polynomials or polynomials multiplied by a posi¬ 
tive integral power of 1/x. The index of this power does not exceed 1. For 
suppose we have 

•»„(*) =(i)’t ‘u+ *>+"■} »). 
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in which r> 1 and not every t.. is zero. Substituting i/p = x l '~ f y (») hi (21) 
and equating coefficients of lowest powers of x, we obtain 

0 = (*.*■=!, *, •••,*)• 

By taking i = 1, 2,. • •, n successively it is inferred that t.. = 0 (ij = 1, 2, • ■ •, n) 
since (^(0)1 + 0. This is contrary to hypothesis. Therefore we must have 




where P tJ (x) denotes some polynomial in x. 

THEOREM. There always exists a canonical system of equations equivalent 
to (1) at x = oc, 


(25) 


</* = L p v (*)y, 


(f= 1,2, 


•»»*). 


in which the P if (x) are polynomials. 

The proof of this theorem has been based on the assumption that there exist 
n linearly independent solutions of (1) of the character (19). The extension 
of the above proof to the degenerate cases is illustrated by taking a degenerate 
case for which Then there exist n linearly independent solutions 
such that we have in place of (19) 


SV'-ftri", 

• • • • 


•••. Hi”. 



••••••• 

77 = p..y;'+i/r 

'77 - P.-,77+'7r", 

77 ~ P-,’7:' + !/r" 


In this event we may write 

y‘f = .A $ v (x)t kj = log P J (| = 1. 2, • • •, n ; j= I, 2, • • •, « —1) 


[' 9 *<*> + ivv l j-- 0 *..-.(*) log*] < < = 1. 2, • ■ ■. ») 

* nd " ,e functions e ,M) tluis define,1 will be single-valued and analytic for 

1*1 >U- 

Furthermore we have 


| 0..(x) | = 

Thus the functions O^x) satisfy the restrictions of the lemma of § 1, and a set 
of multipliers \..(x) will be determined. If now the transformation (4) be 
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made, the new system of equations in y. will have a fundamental system of 
solutions 

<»*= 1,2. •••»**; J — 1,2, • • •, n — 1), 

ft" = X*-- £ U*) + 2rp y_ ! ?-..-,(*)] (' = ». 2. •. »), 

in which f v (x) is an entire function and |{„(0) | + 0. The rest of the argu¬ 
ment in the degenerate case is very like that employed before. 

§ 3. Formal solutions anti the polynomials P.j(x). 

It can be shown easily by means of the formal solutions of (1)* that the poly¬ 
nomials P tJ (x) cannot all be of degree less than q + 1. 

Let us first consider these formal solutions in the case q = 0. If we substi¬ 
tute in (1) formal expressions 

<i = l,2, 

A { (x) = A. -f d , i l> £ + •••» 

and attempt to determine a, /9, ■ *, A i% A { \\ ••• so that the equations 

resulting are formally satisfied, and not all the n quantities A. are zero, we 
shall find that a is one of the n roots of the characteristic equation (3) and that 
we can then determine 

A tt 0,A\'> t A'**', 

in succession ; thus there arise precisely n formal solutions of ( 1 ) 

9,-8,”, •••. . .. 

where 

S'" = A,,(x) (•■.>= l. 2 . ••,»), 

( 26 ) p A x ) = a >^r\ + ^ +• - + V' 

^*( x ) = A tj + A'lj- H- 

The first set of equations determining A u are 

(27) A v a j= (f,i=l t 2,--,n). 

* SciILESlNOER, Vorletungen iibrr Unrare Difftrentialgleichungen, pp. 176-182. For a discai- 
sion of regular singular points see pp. 122-171. 
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It is necessary for us to know also that the formal determinant 


I £<>.( _ trv)*-*™#,....*.. | D + + ... J 

does not vanish identically. As a matter of fact, the constant D = | A x) | is not 
zero. Otherwise we could find multipliers m,, m tt . •m w , not all zero, such 
that 

m 

'E'Aymj — 0 (f.1,3,"•,»). 

Multiplying equations (27) by m.(j =1,2, • • •, n) and adding, we obtain 

■ 

Z A^ntj ~ ° (< = i,a,•■,*). 

A second set of multipliers is m] = a m y ; likewise a third set is mj = a)m r 
Thus we obtain n sets of multipliers m Jt a t m j , • - •, a?-' m.. It is clear then 
that we have 

m 

Z 4 «,«, + — + - 0 


for all values of c # * c,, •• •, Since a i% .. a m are distinct we conclude 

that 

A iJ m J = 0. 

Hence for any value of j for which m, 4 = 0 we have 


which is impossible. 

Let us turn now to consider the formal solutions of any system (I), equivalent 
to (1) at x = co by means of a transformation (4). The formal solutions of (I) 
are defined by the equations 




k-1 


= 2 , •.-,«) 


Hence S contains the same exponential factor as S“\ although Je may differ 
from Kj by an integer. The formal determinant |$y>| is also not identically 
zero, since we have 


The same results are obtained for the case q =- 1 except that the exponential 
multipliers e p ‘ w are lacking in the formal expressions Sf and S‘‘K 

We will now prove for the case q £ 0 that the rank of any system of equa- 
t.ons (1) equivalent to (1) at *= » is at least q+ 1. Otherwise we must 
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lmvc equations like (27) (•(['. being replaced by fi'J) to determine the constants 
A tj of the formal solutions of (f). in which however 


These equations are 


<7„ = 0 


Vi-® 


U,j -1,2, H 


(',>=1.2, •••, h), 


whence we see that A SJ = 0 (i — 1,2, •••. n) for any a. not zero. This is 
impossible. 

In particular, then, the rank of the canonical system (25) equivalent to (1) at 
a* = oc is at least 7 + 1. 1 he /ydynniuialx /V ( x) cannot all he of degree h ss 
than q -f 1. 

We shall now prove that the thy ret of the. polynomials P,j(x) can he token 
not to exceed q -f 1 . 

Let us denote the n formal solutions of (25) by Tf\ Tj\ • T ' 
O’ " 1 * 2, • • •, i») where 


(28) 




Af —/ y being an integer. Let t t be the index of the highest power of l/x 
which all the P,.(x.) (j = 1,2. •• •, n) contain as a factor for fixed i. Then 
we may write 

C. / (s)-C. + C:;l+-... 

By a succession of transformations (4) of such a character as not to change 
the form of f25), we now proceed, if necessary, so to transform (25) that 

(29) |C,| + 0. 

Assume the indices 1 to n so chosen that 

— t m . 

If \C,j\ = 0 , oiie can choose constants c,, c t% • • •, c h not all zero, such that 

c,C,i+',C,j+ ■■•c.C. / = 0 0 = 1 , 2 ,...,.). 

Let c # be tlie last one of this set which is not zero. The transformation of the 
form (4), 

y.-y. (*+•)» 

!i. = jf, + -+ • • - + c.y., 
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leaves (25) unaltered in form. The new formal expressions Tp are the same 
as 7V ) for i 4= s. Also 

= c, 7Y* + c 7 S--''T'' -f ... -f cP 


± r , c v + ±+ 0=1.2, 

1 /e« Is* * ] 


The terms 0'« 1,2,.n) are all zero by hypothesis. The for¬ 

mal expressions 7\" (j *= 1 , 2 , .»*) will contain another factor l/x, and we 
have increased t t by unity or more. 

But we have 

l ? 7 , l = c.|r , '>| 

directly from the transformation. In this way we can increase t a , 
without altering, save by a constant factor, the formal determinant 
Since the determinant contains 1 /x at least to the power 

/, + f,+ ••■ + /„ + <. + t,+--.+i m , 

we see that this process of transformation must finally come to an end. This 
occurs when (29) is satisfied. 

When (25) has been transformed so that also (29) holds we may prove at 
once that the degree d j of />, can not exceed ? + 1 + t, - t. . In fact, if some 
of the integers 

are positive, let t be the greatest of them. Also denote the coefficients of the 
term 

in P v (x) by p r Substitute the formal solutions in (25). When coefficients 
of highest powers of a are equated to zero one obtains 




(f, * = 1 , 2 , 


which is not possible in view of (29;. 

Je note at this point that the degree of />,.(*) does not exceed o + I if 

i -y. We will now make further transformations so that this will also be true 
when t > j. 

Let us first attempt to remove the constant terms of P r (x) (j= 1 2 •. • n—1\ 
by a transformation ** ’ * 


Vi « Vi (* + »). 


IJn C mnVn + +-H C |t| y, , 
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( x )~ e .,PA x ) 

( «. J+"), 


(«■+». 

(30) **<■) = t C.; PA*) ~ t % P (*)'■> 

(*+«>. 

J = l j= 1 

II 


•?..(*) = 

J al 



It is to be noted that the degree of P fJ (x) (i, j =» 1, 2, • • •, n) does not exceed 
q + l + t. — t If y/"' denotes the constant term of P.. (x) we have to satisfy 
the conditions 

Z (* = 1, 2, • • •, n -1). 

i m \ j-i 

Or if we write 

£«•,#> X - Xc - 

J m * 

these conditions may be stated in the form 


(■I 

This gives the equations for X, 

Now each transformation employed since we obtained (21) has not affected 
the form (22) of the solutions. Hence (25) still possesses a fundamental system 
of solutions of this form. A direct substitution of these solutions in (25) will 
show that the roots X are the quantities 

k x — m y • ••, k m — m. 

The roots X are therefore distinct and do not differ by integers in the case 
in which (1) has n solutions of the form (19) where p, y p 2 , are distinct, 

inasmuch as 

k. = - —logp i (; = 1 . 2 , •••,«). 

; 2*V-1 brj 


We confine ourselves to this simplest case. 

Choose for X any root k m — m , determining a set of values ( j = 1 , 2 , • • •, n) 
in which c„ 0. Such a root exists, since otherwise the linear equations 




)=• 




= 2 , •••, n 1) 


could be satisfied for n values of X, and there are only n - 1 such values. 
The corresponding transformation (c m =1) will remove the desired terras. 
Also we have = k m — m in the new system of equations. 
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(31) 


/y*)-w 

I J „(x)-= P„(x) -cx'P u (x) 


Next we remove the constant terms of P,-i.j( x ) O' — 1* ^^ ^ 

a transformation 

y.-y, (* + »-!). y._, = y.-. + f .-i , 3,1 • 

This can always be accomplished as before and we shall have reduced the con¬ 
stant term of y>_,. „_,(*) to -.... Continuing thus, we finally remove all 

the constant terms of P^x) (i <j) leaving 

- t, - m , • • -, Ji* = k.-m. 

When the constant terms are removed, we apply a transformation 

Di = Vi (* + “)» y. = y. + cx 'y> 

to remove any term ;."ja’ of P.,(x) («> 0)ot de S ree T less than _ ‘ A 

direct computation shows that we have 

(* + «). 

/>.,(*) - P.s (*)- «' [ **..(') - (*) + • 

It is clear that the degree of /„(*) (i,j = 1,2, n) is not greater than 
q + l + t t — tj since r < — f*. 

Let us take t = 1. All the new terms in P fJ (x)(i <j) are at least of first 
degree, since a multiplier x appears throughout in them. Furthermore, since 
P (a (x)(i>a) and P B) (x)(0>j) contain no constant terms after the previous 
transformation, it is clear that there are no new terms of first degree except 
possibly in 

p„(*) (*'<«). A»(*) (0<i)- £*(*)• 

Also the terms of the first degree in P, 0 (x) can be made zero by a proper 
choice of c, namely such that 

i< 8 -«(*.- 

This shows that we may remove the terms of first degree in P {J (x) — 1), 

by taking them in proper order, namely with increasing i and decreasing j. 

In the same way we may continue, taking r= 2, 3, • until all terms of 
P..(x) (i>j) of degree less than t. — t j are removed. We may therefore 
write finally 

p»(«)-*-*«,<*> 

where Q 0 (x) is a polynomial for i >j. This holds likewise for i = j, since then 
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t t —tj2S0. Also Q,j(x) will be of degree q + 1 or less, since P v (x) is of 
degree q + 1 + t t — or less. 

The transformation 

y i = xf'y i (< = i, 2 , •••, n) 

will give us a system of equations 

X dx~%^ X ^‘' 

in which 


P„(x) = #-*P ¥ = Q tJ (x) (i 4= j), P«(*) = P,(x) + *, -«„(*) + 

Thus there exists a canonical system (25), equivalent to (1) at x = oo, in which 
the degree of /V (x) does not exceed q + 1. 

We still have at our disposal a linear transformation with constant coefficients, 




In view of the nature of the roots of the characteristic equation (3) we may 
simplify P t) (x) by this means so that the equivalent canonical system of equa¬ 
tions becomes* 





( 32 ) + <■+;>• 

p u (x) - +rii* + • • + *,**'• 

The result which we have obtained is that there exists a canonical system 
equations of the form (32) equivalent to (1) at x = oo. This form will 
hereafter assumed. 

In the case q = — 1 the canonical system reduces to 


dY. 

dx 




X *L 

X dx 


r. 


But this system of equations is equivalent to (1). Hence, as stated in the intro¬ 
duction, there exists a fundamental system of solutions of (1) 

y/*=X |> (x)x-S tf’-M*) 3 *’ U=1.2,--,n), 

where \.(x) are analytic at x = oo or have a pole at that point. As there are 
precisely n formal solutions, 

y t -*A v [x), —, y.~ x* A„(x), 0 = 1,2, »), 

M*) = A il + Ayl + -, 

• For ao analogous redaction in the case 5 = 0 see Schlksinorb, loo. oit, pp. 183, 184. 
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we infer that \ f ,(x) = A..(x) and is therefore analytic at x = oc. This is 
the fundamental result for a regular singular point and is due to Salvage. 

§ 4. Generalized Laplace integrals. 

The problem of the paper now resolves itself into a study of the solutions of 
(32) for 0 = 0. 

Let us consider first the case q = 0. The equations (32) are then 

*5 - + a ' x)Y ' +p " r > + ■ ■ • + p '- v -' 


(33) 


•5 - * R r * + 1*8 + v ) 1 ". + ■ • • + *8 r - 


• = Jftr, + rf? r, + - + (/>•::+ 


The formal solutions will have the form (T'(\ T^\ • • •» T'^) (j = 1,2, • • •, n), 
where 

TV'-e** *«,(*). 

Direct substitution shows that 

In this system of linear differential equations write 

(34) r, (O* 

By employing the formula 

je tl xu(z)dz = [e"tt(*)]-JV*c/u(z), 

in which the path of integration will be subsequently so chosen that the 
bracketed terra may be omitted, we obtain, by equating the coefficients of e" in 
the integrands on both sides of (33), 

— + 1 K+rf. l, >.+ -• +/0.. 


(./=!, 2, •••,») 


(1 = 1 , 2 , 


(35) 


-(*-«,) 5 , -ri5N. + (riS+ ih,+ -+*. 


- (»■- oj=+ (p)3i. + ■ ■ ■ + +1 H 


*Annales de l’Eoole Normale, series 3, vol. 3(1886), p. 392. The analogous result 
for a siDgle equation of the nth order is due to Fuchs. 
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We shall show how to employ (35), thus formally deduced, to obtain an actual 
representation (34) of the solutions. 

The system (35 ) has « + 1 regular singular points at z=3,, 
and z = oc , and no other singular )>oiuts. The exponents at .»• = 'i ) are all zero 
except one which has the value — y/"’ — 1 = — — 1 . These results show that 

we may choose a fundamental system of n solutions, of which »t — l are regular 
at x = a, and one has the form 








VJ'=(Z 


in which X s Pi 2 — <*,) > s regular at z = . This is true except possibly when 

l. is an integer. We will refer to this possibility later. 

The formula (34) will represent a solution of the given canonical system of 
equations provided we write 

rj x = rj t = 1 ij\ • • •, V, — 

and the corresponding contour c j of integration is taken to be a “Zoo// circuit" 
that is, a circuit extending along a (properly chosen) ray from z = x to the 
vicinity of * - a , and then around the point in a negative sense, returning 
finally to z = oc along the ray. The direction of the ray must be so chosen 
that it docs not contain another singular point z = and that 11 (rz) (the 
real part of arz) is negative along the ray. AN hen this is done there is no diffi¬ 
culty in proving that each set of formulas 

yj) = JV(z - a.)-*- 1 X'/\z - a, )</z (f« 1 , 8, - 

represents a solution of (33), at least in certain sectors of the plane. 

In the case in which — /. — 1 is a positive integer the loop circuit is replaced 
by a straight line path from z = 0 to z = co. When — Z y — 1 is negative or 
zero, there will be a solution which contains a logarithm. We choose this solu¬ 
tion to replace i//\ •••, */”• In all cases we obtain a formula for each 

singular point. 

This elegant method of attack (for a single linear differential equation of the 
nth order with rational coefficients) was given by PoiNCARfc in a fundamental 
paj>er.* The method has been more fully developed by HORN,f who shows that 
a direct study of the solutions can be made by these formulas in the complete 
vicinity of x — x. 

When q > 0 the above method fails. PoiNCARf: has indicated a method of 

•American Journal of Mathematics, vol. 7 (1885), pp. 217-232. 

f Mathematieche Annalen, vol. 50 (1897), pp. 525-556. 

Tram. Am. Molb. Soc. 30 
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quadratures reducing the case 7 > 0 to the case 7 = 0 , but the application 
of this method seems impracticable. * 

It is the aim of this paragraph to show that, at least in the case before us, 
formulas exist for 7 > 0 , analogous to those given when 7 = 0 .f 

First let us consider the case 7 = 1 . The equations (32) are in this case 

7 J r 

* d ; - (Ki+ p\> + +(rf!+*«*;+ • • • + Ofl+rfM r -' 

X d -yJ - 04?+ p» r, + (PS++ v?) r ,+• • - + (rf: + ;/„■» K, 


(36) dx 


~ ( 7' L ‘ +i>L'^ ) r ! +. • -+ 0 C+/O+ 

The formal solutions are (7V\ 7Y\ •••, TV’) (j = 1,2, •••,«), where 


*,(*)-** + *5* + - 

If these formal expressions be substituted in (36), we obtain 
(37) 


0 = 1,2, •••,».). 


(« = 1 , 2 , 


We shall take as the generalized form of (34) in this case 

(38) J\ = J*** [*•(*)+ **n(*)]* 

Now substitute this expression (38) in (36), integrating by parts according to 
the formula 

J e*"z?ti(s)<fc = [e** , ?i(a)] — J e ti, du(z) % 

in order to reduce the terms which contain x* and x 2 to terms in x' and x° re¬ 
spectively. If the bracketed terms are omitted and the coefficients of e*' : and xc* 1 * 
in the integrands are then equated, there result the equations: 

- (2* - a/j;- = -1,, + t MT*,, + (• = '- 

-(2s - afy + = 2, 10 + E^,. «=>. 


(39) 


J=l 


•Horn, Acta Matbematica, vol. 23 (1900), pp. 177-201. 

fCf. Cunningham, Proceedings of the London Mathematical Society, ser. 2, 
vol. 4 (1906), pp. 374-383. 
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Thus we reach a system of 2 n linear equations of the first order in z to deter¬ 
mine the 2?i functions 


’hi* *?:i* •••» ’J.i* ’ho* '"* ’U* 

This system of equations is analogous to (35). It has clearly singular points 
at z = a,/2, z = aJ2 y • • •, z = oJ2 and z = oc , the last of which is a regular 
singular point. The finite singular jtoints arc not in general regular , although a 
slight consideration proves them to l>e of rank q = 0 at most. 

It is possible however to make a given one of these singular points z = aj 2 
a regular point by first making in (36) the simple preliminary transformation 


The new system of equations (36), ami hence also (39) will be the same as 
before except that //'/ is replaced by fi" t — £, (»— 1.2, • • •* n ). These n sys¬ 
tems of equations (36), in general distinct and obtained by taking »•= 1,2, • • •, ?i, 
replace the single system (35) of the case q = 0. 

The rth system of equations (39) has a regular singular point at z = aj 2. 
For the system (39) may be written 


(f-1,2, 


S 0 = 2 ^- 

(i=1.2, • 


A sufficient condition that z = aj 2 be regular is that there is no pole at z = a/2 
of the second or higher order in the coefficients of •••, »? B ,» »?,<>» 
in the right-hand members. Clearly such a pole cannot exist except in the 
second n of these equations when i = r, and then only if /4'V + But 
in (39) fi',1 is replaced by p') — £ r , which vanishes by (37). Hence the state¬ 
ment is proved. 

The exponents of the regular singular point z = a t j 2 can be verified directly 
to be all 0 except two, which are 

_/ r _l 2 

2 —’ 2 * 


Hence there exists a solution V,7» •••* V.7* Vi?* ** ** Cl °* * orra 



where ^3(* “ <*./ 2 ) an(1 ???(* - a rl 2 ) are analytic at z = a r /2. Each set 
of formulas 
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will then represent a solution provided that c , is a loop circuit to z aj/Z so 
chosen that R[t?(z - a,./ 2 )] is negative along the ray. 

jf (_/ — 1 )/2 is an integer, we can obtain solutions corresponding to the 

other exponent 2 )/2 which will not be an integer. 

The proof of the validity of the above formulas is exactly like that in the 

case 9 = 0 . 

Let us now consider the general case when we have 

We write 

{ V*(*) + x lA z ) + • • • + *%(*) 1 *' 

where the iy y (*) are functions to be determined. First we have by substitution 
in (32) 

f [(?+(£ Ky)(f:*v*)) d* 

+ fe tr 'a i x 1 * , £x , T} (l (z)dz (« = 1 , 2 , •••,«). 

J lm 0 

The terms involve x°, x, • • •, X*" 1 . We propose to satisfy each of the equa¬ 
tions above by deleting in turn all but the terms in 

x• and (, = 0 . 1 . •••, 9 ) 

and then satifying each of q + 1 equations resulting. One obtains after the 
omission of a factor x* 

f‘■'"‘[(9+ !)*+'*+jlVuW* 

= £ f«**"•( £ £ r f 3vjf.*)) dz + 

By an integration by parts 

fe** lg &+ l w(z)cJz = [e xT *'-‘to(z)J — fc*"*dw(z), 

in which the bracketed terras are again cancelled by a subsequent choice of 
limits, we may transform the coefficient e*" 1 in the integrands so that it is inde- 
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pendent of x . Equating these coefficients we then obtain in a formal manner 
the system of n(q + 1 ) differential equations for the functions 17 . 

- [(} + 1 )* - a,] E - (? + 1 - « H, + Z 


J=\ 


(» = 1 , 2 , •••, n; «= 0 , !,•••, 7 ). 


These equations written in succession for s = q , q — 1 , • • •, 0 arc 

- [(? + !)* — a i] -* 1 = ■)* + £ ZrfJVj, (i -1, . 

- [(* + 1 > -*) %-• +1 rift? “ 2 V. + 5 

(#=1,2, 

< 4 °) . 

- k.+ 1 >-*] t+••••*• £ «-’r 

"(f+l>1. + £ £ //,*>;, (<-1.2. . 

/-I * + /»0 

The singular points of (40) are at once seen to be z = aj(q + 1), 2 = aj(q + 1), 

.. z = aJ(q + 1) and 2 = 00 , the last of which is clearly a regular singular 
point while the others are of rank q at most. 

As in the case 9 = 1 we can, however, make a given one of the points 
z=sa r /(q+ 1) a regular singular point by making a preliminary transformation 

. /1 /fi 

The new system (32), and hence also (40), will not be changed except that 

tfU . K? (. = 1 , 2 , 

will be replaced by 
respectively. 

These n systems of equations (40) (r =1,2, •••,») will, as in the cases 
q = 0 and q = 1, lead to Laplace integral formulas. 

We shall now prove that the system (40) has a regular singular point at 
z=* r l(q + l). To prove this fact we note that, in view of the preliminary trans¬ 
formation assumed to have been made, the system (32) admits a formal solution 
T\'\ T;\ •••, T'\ where 


T'? = e* r *+ l x , 'B ir (x) 


(* = 1 . 2 , •••,»)• 
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Now it is easily proved that if the system (32) has a formal solution of this 
kind, then the adjoint system of equations, 




will have a formal solution 

*,(*) = {C,+ &;>--+■■■} 0 = 1 , 2 , • • n) 


1 , 


in which not every C, (j = 1, 2, •••, n) is zero. If this formal solution be 
substituted in the adjoint system, we obtain the following set of linear equations: 

a r Cj = (XjCj (i = 1.2,•••,«), 

a,C“>-* J Cy+i:C i jt$ (j = 1 . 2 , • • •, n), 



From the first n of the above equations we infer that 


Cj = 0 C r * 0. 

Multiply now the n equations of the ( k + 1 )th line in (40) by C { (i — 1» 2, • • •, n) 
respectively, the kth set by C\ l) (» =» 1, 2, •••, n), and so on. Then add the 
first nk equations thus obtained. In view of the above relations between the 
&P one finds 

- k,+i).-c£ j <?,%=-*++- • ■+ 

in which L k (rj) denotes a certain expression linear in rj {J with constant coeffi¬ 
cients. Since = 0 (» * r). C r * 0, we may write this in the form 

+ (f+ iV^u 

(* = 0 t 1,2, •••,?). 


( 41 ) —£T — O', t: I + " + ’ 




We are now able to prove that z = <*J(q + 1 ) is a regular singular point. 
This will be proved if we show that when we solve (40) for the derivatives of 
fj.j in terms of the i j tJ themselves, the coefficients of the expressions obtained 
have at z = a r /(q + 1) poles of the first order at highest. The expressions for 
dijijdz (£ = 1 , 2 , • ••, 7i ) certainly satisfy this condition, as the first set of 
equations (40) show. In solving for drj i 9 _ x /dz (i = 1,2, • • •, n) we distinguish 
the cases i =|= r and i = r. Since in the first case drj. 9 _ x /dz appears with a 
coefficient — [(9 +1 )z —a.] not zero at z = a,/(?+ 1 ), it is clear that poles of 
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higher than the first order do not appear. To solve for drj r 9 _Jdz we use 
equation (41) for k = 1 and deduce the same fact. In this way we solve for 
drjjdzy drj. ^Jdz, • ••, dnjdz , each time making use of the appropriate 
equation (41), to prove that no poles of higher than the first order arise. Thus 
the statement is finally proved. 

The n(q + 1) exponents at z = aj(q + 1) are all zero excepting q + 1 of them, 
which are (- Z, - 1)/(? + 1),(- /, - 2 )/(q + 1), • • (- Z - q- 1 )/( 9 + 1). 

I have not found a direct proof of this fact, such as no doubt exists. An out¬ 
line of an indirect proof is found in the foot-note p. 462. 

We will have then a solution rf'j (* — 1, 2, • • •, n; j = 0,1, • • •, y) such 
that 

where [z — aj(q + 1)] is a function analytic at z = aj(q + 1). This 
leads at once to the formulas 

<«> ry(. - , J3? £*«(. - fjr)* 

where c y is a loop circuit about * — «,/(? + 1 ) so chosen that 
R {x ,+l [z — a,l(q + 1)] | is negative along the ray. These actually repre¬ 
sent solutions, as can be proved just as in the cases q = 0 and q = 1 . 

If (_ __ 1 )/(q + 1 ) is an integer, one may use solutions corresponding to 

the remaining non-zero exponents, none of which are integers. 


§ 5. Asymptotic representation. 

If one makes the substitution 

in the formulas (42) for j = r, there is obtained the expression 

Y''<=t r "'3f. fe'u>7* J £x'¥ , ' l (x-'-'w)dw, 
Je_ J-® 


where c' r is a loop circuit to to = 0. When one expands $ v (*~ ,_, w») and 
evaluates the successive terms by means of the formula 

JV Wto = (e~ w " r=i - e'^- 1 ) T(* + 1 ), 

there results for YV a sum of terms of the same form as in the expression Tp 
plus terras which approach zero as x becomes infinite.* 

• For a more complete consideration of a like point see the paper by Horn previously referred 
to, Matheuiatische Annalen, vol. 50 (1897), pp- 527-531. 
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In order to state the result concisely we shall modify slightly the conventional 
definition of asymptotic representation. 

Definition. We may say that Y i is asymptotically represented by T\' 
along the ray arg x — <£, or in symbols 

T^Tp % arg x = <f>y 

if for some sector including arg x = <f> as an interior ray and for all values ofm , 

[ AA 1 ?-" e 1 

in which lim e = 0 as x becomes infinite. 

The result is then that we have 

r?>- 7Y> 

along any ray for which R {x* +l [z — aj(q + 1)] J is negative. These expres¬ 
sions TV* are, of course, identical with the Ty> which occur in the formal solu¬ 
tions of (32).* 

The ray of the loop circuit toz=^/(^+l) may vary in direction as long 
as it does not intersect one of the remaining n — 1 finite singular points 
z *= <*;/(? + 1 )• It is a simple matter to give the actual sectors on which in 
general the above integral formulas hold. Denote by t,, t,, •••, t s 
N^n(n — 1 )(?+ 1 ) rays in angular order for which 

R {(«* —} -0 (i+*), 

and also write r y+ , * r, + 2ir . We assume these rays to be distinct. Further¬ 
more, the real part of some particular one of the differences (<*j — a *)» ,+l 
changes from positive to negative as we pass from the sector (t^,, r m ) to 
( t Denote the subscripts in this difference byj m and k m . The ray 

of the loop circuit c i has an argument intermediate between a certain successive 
pair of arguments 

ar g ( *t* - ***) and ar g ( - a *. ) 

for which k m , = k m =j. In this case R {x * +1 [z - Ojl(q + 1)] } remains nega¬ 
tive for 


T -< ar S*<V + ^+l' 

The formula (42) 

_ Y X -YV, Y t -TY , Y u = 

•At this point it is easy to see that the exponents at z = a,/(q + 1) have the values above 
stated. For by comparison of the Laplace integrals and the formal solutions we get these values as 
possible exponents, and no others. But in the special case for which P v (z) = 0, P u =p!< ’ + a,x n , 
it may be at once proved that the exponents have precisely these values. Then if the ooeffioiento 
of the polynomials are changed continuously, the exponents are also so changed and henoe retain 
the form speoihed. 
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represents a set of q + \ solutions fixed by assigning the sector in which the 
ray of the loop circuit is to lie. This formula is valid for any one of the q + 1 
corresponding sectors 

T m <argx<T„. + -^- v 

It is to be noted that m and m are any pair of successive values of m for which 

If now we pass back to the solutions of (1), we infer that there exist solu¬ 
tions y,,y,, •■•>y«°f (!) such that 

(<=l,2,..,n) 

within any given sector 

This result has also been derived in certain special cases by Horn,* who has 
used the method of successive approximation. 

§ 6. Characterization atz = oo. 

At this point we can readily give a characterization of the nature of the solu¬ 
tions of (32) and thence of the system (1) equivalent to it at x = oo. 

Theorem. There exist JV-n(n-l)(y+l) fundamental systems of 
solutions of the canonical system (32), 

ryj. na, •••, ra, 

(43) . 

TO. to. • ••, TO, 

sucA that 

TO-7y\ r m — argx<% +1 , 

anrf *ucA Mai 

TO-. - TO 

ns., - w+ a. 

provided we write Y\f s + X = c 2 *** _l -Pi/’. 

Proof. In order to avoid confusion we will state at the outset that the solu¬ 
tions (42) are not in general the solutions referred to in this theorem. The 
latter retain their asymptotic form throughout minimum sectors, while the former 
retain it throughout maximum sectors. 

•Crelle’s Journal, voL 133 (1907), pp. 19-67 


(m^l,2, 


(>+>) 
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An immediate conclusion of the results of the preceding paragraph is th(U 
there exists for every ray arg x = <f> a set of solutions Yf, Y'{\ •••% 

(J = 1,2, • • •, ») such that along this ray 

Tf. 

In virtue of this fact it is possible to separate the x-plane into some finite num¬ 
ber of closed sectors <r, in each of which there exists a fundamental system of 

solutions Y\K IV.—. *.—.») su ‘ h that for * 

in <r. We choose these sectors so that the rays arg x = r m are included at 

most one within each sector a . 

Throughout a sector tr not containing an inner ray r. every solution 
Y f , Y t% •• •» Y n of (32) has a definite asymptotic form. For the general solu¬ 
tion of (32) is 

(44) Y t - c, Yf + c, Yf+ ... + cYf (. = l. 2, •••, »), 

which gives the asymptotic relationship 

r < -c 1 7r + c,7y>+... + c n T;\ 

The relative magnitudes of these n terms as x becomes infinite in a clearly 
depend on the relative magnitude of 

(45) 1 ?(«,«*♦*), •••. 

This does not change except at the rays arg * — r,, r t , • • •, r H . Suppose for a 
moment that indices 1 to n are so chosen that the quantities (45) appear in order 
of descending magnitude. If c k is the first of the quantities c,, c a , • • •, c n not 
zero, one has clearly for x in a 

Furthermore, since the successive regions <x have a ray in common, every 
solution Y x% Y 2 , •••, Y n preserves its asymptotic form throughout successive 
sectors a until one which contains a ray t ,, t ,, • • •, r, v is reached. We may 
therefore unite into a single sector the region a which contain a given ray r m , 
and all those which succeed it in angular order up to the one which includes 
the ray t^, , and affirm that on each of these sectors a m there exists a 
fundamental system Yf, Yf, • • •, Yf (j = 1,2, • • •, n), for which Yf ~ T'f 
throughout the sector. 

Next let us determine the character of the general solution Y x , Y t , • • •, Y n 
in one of these sectors tr m which includes within it the ray arg x = r m . Here 
again the relative magnitude of the quantities (45) determines the important 
term in (44). But i?)P ;<< (x)( and i?{P 4 m (x)) change order of magnitude 
along arg x = t w , and moreover 

R {(«>.- O^'Jc 0 
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If we suppose indices so chosen that the quantities (45) are in order of magnitude 
at the first boundary ray of <t b , two successive terms of (45), say the ath and the 
(s -f- l)th, will change order of magnitude for arg x = r m . We see by (44) 
that the solution Y l% Y 2 , •••, Y m will preserve its asymptotic form in a m unless 

c, = Cj -= «_, = <>, c, + 0,c, tl + 0. 

In this case the solution is of the asymptotic form 

at the initial ray of the sector, and of the asymptotic form 


. , . 

at the terminal ray. 

If, however, we have two solutions such that at the initial ray of the sector 


r.-'.TV 

r;~ e ; t ,rr" 


<i= 1,2 ,■•,«), 


we can always choose a combination Y” = Y t + A Y[ so that (Y\\ Y' t \ • • •, F’") 
preserves its asymptotic form c t T\‘\ • • - , e a T 1 ^ throughout the sector 

c m . For we have 

- e.r’V’+ u,y'r "+■ • •+ 


The required value of A is — c #lfl /c' +1 . 

Let now Y\\\ Y[f, •••, Y\g (j = l, 2, • ••, n) be any fundamental sys¬ 
tem of solutions of (82) such that 

Y[f~ T? 


for x on <r,. All these solutions will preserve their asymptotic form throughout 
<r 2 except perhaps the solution 

Y%\ Y<£' 

in view of the preceding discussion. However, in this case we can choose a con¬ 
stant A , so that the solution 

I",r + A,r*\ T<ff+A,r*\ry + 

preserves its asymptotic form 

throughout the sector <r 2 . Therefore if we write 

r</'=rtf (i+A). 

YVf-rff + A t r« 
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the new fundamental system of solutions Y\i\ YU\ ..., Y'£ (j= 1,2, • • •, n) 
preserves its asymptotic form throughout the sector a,. 

Passing now to the sector <r 3 , we obtain a third fundamental system of solu¬ 
tions, and so on. Thus we define Y*£ % i r ‘ l a\ •••, Y\ J ‘x +l successively in terms 
of JTf. 

These fundamental systems of solutions have therefore the desired properties 
stated in the theorem except that we need not have 


-T</?.v + , = Y'f 


(ij= 1 . 2 . 


These relations, too, will be satisfied when a proper choice of the initial funda¬ 
mental system 

(46) *TP. Y$ % Y% (y = 1, 2, •• •, ») 

is made. To prove this fact we show first that the final fundamental system 

of solutions 

(47) i» • • •» r<*, + | (j = i, 2, • • •, n) 

is entirely independent of the choice of the initial system (46). 

Let 

1,2, •• •, n ; m = 1,2, • • •, A’) 


be a second set of solutions of the same kind; the constants which correspond 
to A m will be denoted by B m . 

Let /?(a* ,+l ) be that one of the quantities (45) which is least in 
magnitude in (r,, t 2 ). There can only be one solution of (32) asymptotic to 
Tf% Tf y •••, TV on (t,, t,), and we must have 

(48) (« = 1,2,. 


in this sector. Now each pair of the quantities (45) become equal for 2(q -f 1) 
values of r, 

*■ . , 2tt , 2.0 -1 

* T + “Vi* •••, T + —-7T. 


T\ T + 


g+1 


7 + 1 


q + l 


It is clear, then, that on the sector 


(«) T - S X < T, + = T„ ( f = n( " 2 -' ) ) , 

each pair of these quantities (45) become equal once and only once. The quan¬ 
tity i?(ax ,+l ) will therefore increase over the remaining quantities (45) as 
argx changes from r, to t, -f Tr/(q + 1). It is therefore clear that 

YX=Y<;}=... = Y‘;;y 

zv= Z$=... = z%. 
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Hence we find 

y= Z.m ( 1 = 1 , 2 , ••,».) 

for all values of m less than n + 1. 

Next let I?(a i x 1 * 1 ) l>e that one of the quantities (45) which is second in 
magnitude in (r,, r 2 ). We must have in this case 

z:i = v;: + cy<z 


for m = 1,2, • • •, until a value m = 0 is reached for which 1 f (^a^* 1 ) increases 
over It must be remembered that R(a l X 7 * x ) increases over all 

the quantities (45) except R{a t v*+ l ) in (r, t tJ. Now in virtue of the 
relations 



nt + A.r 


(»» 

.#» 


!'<•> — V 1 ** 

J J i#» 



ZS + B.Z'S, 


z::.„ 


— Z**> 


the preceding equality for ni = 0 may l»e written 

(50) z;:,. x - = J'IV. - + «*¥•♦!• 

Also since R(a^* x ) is greater than li{a t *" x ) in (r # , t #+ i ) we conclude that 


We have already shown that 
aud we obtain from (50) 


- B t = -A t + c. 

vm — Z<*> 

1 V#+l “ ^ 1 . 0 * 1 * 


y<n = Z <0 


Since for ro > 0, the quantity I{(*'T'+ X ) will increase over the remaining 
quantities (45), we infer that 

Y ( S >- ZZ «= 1 , 8 ,- 


for all values of m greater than 0. 

Likewise we have 

^=n , + cr; + rfr;,\ 


where Rf^x 9 * 1 ) is the third of the quantities (45) in order of magnitude. 
The relation 


z;i = rs + era + <fra 


will hold for successive values of m, the constants c and c/ only changing for a 
value of m such that one of the three quantities 

/?(«.*"'), 

changes order of magnitude at arg x = r m . If the first increases over the 
second, the constant d may change; when the second increases over the third, 
the constant c becomes zero; when the first increases over the third, the con- 
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stant c becomes zero. From and after this fixed value of m < 4- 1* f° r which 

the first increases over the third, we have 

ITS-*<2 


In tins way we prove that after some fixed value of m</*+l we shall always 
have 

17J-«s2 ••,!»). 


whatever be the particular value of j. Therefore we have 

ne-s/j (,.,•=i,*,.»,«) • 


for m = /x, and of course for greater values of m, in particular m = N+ 1. 

Choose rtf to be this fixed set Y J'i,., multiplied by e~ Zm, '^~\ and all the 
conditions of the theorem will be satisfied. This is a possible choice, since T 1 /' 
after a complete circuit of x = oo becomes and therefore we have 

Y/' N + l ~ in (t,, t,). This completes the demonstration. 

The theorem whose demonstration has just been completed leads at once to a 
like theorem concerning the solutions of (1). 

Theorem. There exist n(» — 1)(? + 1)fundamental solutions of( 1), 



MM 

MM 

• ••. MM 


MM 

MM 

■■■•Ml 

such that 




'Ml 

~ ST, 

7 

,£ar;/x< 

and such that 





Mi*, = 

• Ml 



MM = 

■ MM 

+ A.MM, 

provided ice write yJ/J V i ! 


M,'- 



(m~ 1,2, •••, A f 4- 1 


i+l» 


O' + J-) • 


Tins is a complete characterization of the solutions of (1) in the vicinity of 
x = oo , since any system of functions Y'/J satisfying these conditions will form 
the solutions of a system of equations (1). The characteristic constants which 
determine the nature of the solutions are the n (q + 2 ) exponential constants 


a j* @j' ‘ K j U— 1. 2, • n), 

and the n(w — l)(fl + 1) transformation constants , A 2 , • • •, A >v . Not all 
of the constants A n are independent, since instead of the particular n formal 
solutions used we might have multiplied each of these by an arbitrary constant. 
The new transformation constant would then be altered to 
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Two sets of transformation constants thus related are considered to be the same. 
Any two systems (1) and (I) with the same characteristic constants (except that 
k. and k. may differ by integers) may be proved to be equivalent at x = oc, and 
conversely. 

The total number of essential characteristic constants is accordingly 
n(j + 2) + a(n-l)(f + !) — (»—!) = «*(? + 1 ) + 1 » 

But we can dispose of « — 1 further constants of (32) by the transformation 

= « = 1 , 2 ,- 

It is clear then that the canonical system of equations (32) also has at most 
n J (»y + l) + l essential constants. If it be admitted that no further reduction 
of (32) is possible, we infer that the characteristic constants arc not connected 
by any necessary relation. 


§ 7. Generalized lliemann / iroblem. 

We now propose the following generalization of the Riemann problem: To 
construct a system of n linear differential equations of the first order with pre¬ 
scribed singular points 

x,, x Jf •••, x m , x m + x — oo 

of respective rank 

7s’ '/.♦ i 

and with a given monodromic group , the characteristic constants being assigned 
*or each singular point. 

Consider the system of equations 


(51) 


dx 


-£(£SV-"j + SM* 


(.=rl, 2, 


with singular points x,, r 2% • • x^, x w „ = oo of prescribed rank </,, q v • • 

In this equation appear 

«'(£?,+ 2 ™+i) 

arbitrary constants. 

In order to fix the problem we consider the fundamental system of solutions 
(y'i", ..., y 1 ’), • •(y‘J\ • • y'f) such that t/f at some point x = a reduces 
to S.j , and take the group of this particular fundamental system of solutions 
to be assigned. 

The monodromic group contains n 7 m arbitrary constants, n 2 for each funda¬ 
mental substitution. Furthermore, the characteristic constants are 


Z ["’(?, +1) + 1] 
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in number; but between these constants and those of the group there are 
w(m + 1) relations, since the n exponents at each of the m + 1 singular points 
are determined either by the group or by the characteristic constants. 

We must further prescribe at each singular point precisely which solutions 
are to correspond to the canonical fundamental system of solutions given in the 
second theorem of § 6. Now the group itself determines the correspondence 
(since it determines the exponents) except for n multiplicative constants. There 
are accordingly 

(** — l)( w + 1) 

further conditions. Thus there are in all 

n*m + £ [n*( 7 , + 1) + 1 ] - n(m + 1) + (» - 1 )(m + 1) 

(■i 

conditions to be satisfied. 

The number of conditions equal the number of constants at our disposal. 
This is the basis on which we postulate the |>ossibility of the general problem. 
TnE University op Wisconsin. 
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A SIMPLIFIED TREATMENT OF THE REGULAR SINGULAR POINT* 

BY 

GEORGE D. BIRKHOFF 

The usual treatment of regular singular points involves the substitution of an 
infinite series in the differential equation; the series thus formally determined 
is then proved to be convergent. It is possible to replace this part of the work 
by a simpler and more direct argument, and this is given below in IV, along 
with an outline of the entire discussion, f The idea employed is quite 
similar to that used by Liapounoff in another connection (see Picard, TraiU 
d*Analysts vol. 3, pp. 362-363). The analysis is made in full only for a single 
equation of the second order, and afterward the generalization is indicated. 

I. We will assume that it has been proved, say by the method of successive 
approximations, that the general solution of 

(1) y" + />(*)y' + ?(*)y-0, 

at a point of the x-plane where p(x) and ?(x) are single-valued and analytic, 
has the form 

y - c .y. + 

in which y, and y, are linearly independent and analytic at the point. 

II. Now let * = a be an isolated singular point of one or both of the single¬ 
valued analytic functions p(x) and ^(x), so that by definition x =* a is a 
singular point of the differential equation. Consider the solutions y of (1) in 
a certain vicinity of x =* a which is taken so small as to contain no further 
singular points; these solutions are analytic except perhaps at x = a , by I. 
If we make a positive circuit of x = a the solutions will undergo a linear sub¬ 
stitution, which we will write in the form 

y. “ c .y. + c »y*> 
y, = d xV, + <*,y>- 

There will exist multiplicative solutions, y = py, for values of p such that 

c, — p d. 

= 0 . 

C. d,-p 
• Presented to the Sooiety, February 20, 1910. 

f Scblxsinobb baa avoided this aubatitution (Journal JUr die reiru und angevxuJle Malhcmatik, 
vol. 132 (1907), pp. 247-254) but bia method ia not nearly so simple aa the one here presented. 
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Thus we are led to the conclusion that there are two linearly independent 
solutions of the form 




_l°g/>i 
1 “ 2lV ’ 


( 2 ) 





or in a degenerate case, />, = />,, of the form 

(3) y, - (* - «F/i(*)» ^ lo g( x - ") + (* ” «)Vt(*)5 

here / and /* 2 are single-valued and analytic in the neighborhood of x = a, and 
can therefore be expanded in Laurent series at this point. 

III. Regular solutions are now defined as solutions of the form (2), (3) in 
which/,,/ 2 are either analytic or have a pole at x = a. A necessary condition 
that the solutions of (1) be regular at x = a is found at once to be that p(x) 
and r/(x) cannot become infinite to higher than the first and second orders 
respectively at x = a % by writing (1) in the form 

_i_ } y" y' y j 

>•' y; y\ y, =o. 
yi ft y: y\ y t 


If the coefficients y>(x) and g(x) have this character at x = «, that point is 
called a regular singular point of (1). 

IV. Conversely , at a regular singular point the solutions must be regular. For 
write 

*-(*-«)y • 

The differential equation (1) has as its equivalent the equations 




z=(x-a)y" hy =-(x-a)[p(x)y+ q(x)y] + y 


Consider now the real positive function 




tf=|r| + |r| 


for values of x on any ray emanating from x = a, and denote the distance 
|x — a | by r. Since, if y = a + i/9, one has / j = er -f /9 2 , it is apparent that 
both terms of U , and therefore U also, are continuous functions of r with con¬ 
tinuous r-derivatives. Moreover the inequality 
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5 ^ 12^1 + 12^1 

holds, inasmuch as, for any function w which is analytic, one has 

w = He 91 , H — I u> I, 


and lienee 


mt[ d R , • T? d • dw 1 

T, “ e [v + ,n <vj’ W= J X = ?• a,' 


a«pj 

dr i I <V* i 


-KI- 


But in the two equations (4) the coefficients of y and z have poles of at most 
the first order at x = a, so that clearly one has 


W\> Kl * 7(|y| + l*l). 


where AI is independent of the particular ray under consideration. Accordingly 
there is obtained 


This may be written 


'dU <2,1/1, . I* AMU 

>1= v|w + ,*,| s —• 

log U^ 4J/ 

” r " dr " r ' 

Integrating between r and r # (r — r 0 ) one obtains 

— 4,1/ log y S log ^ = 4 ..1/ log 


so that 


This gives the inequality 




Now is the value of U at a distance r 0 from x = a. Let G denote the upper 
limit of such values for 0 = arg(x — a) < 27r. Then we have 


By definition of £7 it is clear then that 


[0<arg(x-a)<2tr]. 


[0^arg(x-a)<2T], 


Trans. Am. Math. Soc. 14 
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i. e., the branch of y corresponding to the condition 0 ^ arg (x — a) < 27r 
becomes infinite only to a finite order at x = a. 

From equations (2) and (3) it is now concluded that the single-valued func¬ 
tions f and fi, analytic except perhaps at x = a, have also this property. 
Hence, a positive integer m can be chosen so large that (x — a) m fi and (x — a) m f 2 
are finite in the neighborhood of x = a. By Riemann’s theorem these func¬ 
tions are analytic at x = a. Therefore f and f are analytic or have poles at 
x = a % which we desired to prove. 

V. The formal determination of the series (2) and (3) still remains to be 
made, but for many purposes this expansion is not necessary. The usual dis¬ 
cussion shows that there exist essentially two and only two expansions ( 2 ) or 
(3) which formally satisfy (1), and these must, of course, coincide with the 
desired series. 

Part IV of the above treatment (as well as the remainder) admits of exten¬ 
sion to a single equation of the nth order in y,, or n equations of the first order 
in V\> Vv ‘ ‘'» V *» * n ^e case > 9 always reduced to the second by a 
substitution 

y, = (x-a) v [, .... 

If we write the linear system as follows: 



d Vj 

dx 


t><,(x) y> 


(• = 1 , 2 , 


n), 


the coefficients a^(x) have then at most poles of the first order at x = a if 
* = a is a regular singular point. Hence one finds in a certain vicinity of x = a 


M*)l 


If one considers the positive real function 


M 

• 

r 


tf-ijiyn 

one finds 


by (5), 


The remainder of the argument is now made as before. 
Princeton University. 


dU 


a-r£ |2y ‘ y: 


"(M- 


2 nM 


u. 
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A Theorem on Matrices of Analytic Functions. 


By 


Georoe D. Birkhoff of Cambridge, Mass. (U. S. A.). 


A function I(x) which is single-valued and analytic for \x\ suffi¬ 
ciently large, but which is not analytic at x — oo, may be expanded in 
a Laurent series in the vicinity of x — oo. This series consists of the 
sum of a function a(x) analytic at x — oo and vanishing there, given by 
a power series in negative powers of x, and of an entire function e(x), 
expressed as a power series in positive powers of x beginning with a con¬ 
stant term; that is, we have 

(1) {(*) - a(x) + e(x). 

Thus l(x) is decomposed into a sum of two functions, one of which 
is analytic except at x —oo, and the other analytic at x — oo, vanishing there. 

If the additional restriction be imposed on l(x) that this function 
does not vanish for x sufficiently large, we can similarly express I(x) 
in the form of a product 

(2) l{x) - a(x) e(x) x*, 

where a(x) is some function analytic but not zero at x — oo, where e(x) 
is an entire function that nowhere vanishes, and where k is an integer. 

To prove this statement we note that logf(x) is analytic for \x\ suffi¬ 
ciently large. Furthermore logl(x) alters only by a constant — l,k 

being an integer, when x makes a positive circuit of x — oo. Accordingly 
the function log l(x) — A* log x is single-valued and analytic in the vicinity 
of x ■= oo, and by (1) can be expressed in the form a x (x) + e t (x) where 
a x {x) is analytic at x — oo and e,(x) is an entire function. If we put 
a(x) = e' ,(z, , e(x) — e*« (x) we have at once the relation (2); the functions 
a(x) and e(x) have the specified properties. 

Suppose now that (l if (x)) is a square matrix*) of w* functions, each 

•) The notation and the elements of the theory of matrices used in the present 
paper may be found in Schlesinger, Vorlesungen Uber lineare Diffcrentxalglcichungen, 
p. 18—19. 
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single-valued and analytic for |s| ^ 22. Each of these functions may be 
expressed as a Laurent series for |x|^>i2. The generalization of (1) is 
trivial in this case and may be written 


(hjW) = (««<*>) + 

the decomposition ( 1 ) being applied to the separate elements of the matrix. 

The generalization of (2) for such a matrix is of an entirely different 
nature. I was first led to a theorem effecting such a generalization by 
the study of the singular points of ordinary linear differential equations.*) 
The present article contains a completed form of the theorem and a proof 
of it based on the theory of the Fredholm integral equation. The theorem 
seems to possess an intrinsic interest quite aside from the interest of this 
application to ordinary linear differential equations. In the article which 
immediately follows the present one I have carried out the application 
The theorem is as follows: Let (l fJ (x)) be any matrix of functions 
single-valued and analytic for \x\ ^ R (but not necessarily analytic at 
x — oo), and such that the determinant of this matrix does not vanish for 
\x' R There exists then a matrix (a tJ (x)) of functions analytic at x — oo, 
reducing to the unit matrix (d tJ ) at x — oo, and a matrix (e tj (xj) of entire 
functions, of determinant nowhere zero in the finite plane, such that 

(3) (*» (*., <*)**>) 
where k lt k t , • • •, k n are integers. 

In demonstrating the truth of this theorem let us begin by proving 
the existence of a solution (e t/ (x)) of the matrix equation 

(4) (l tJ (x)) («„<*)) - (&,/*)) 

where (e if (x)) is a matrix of entire functions of determinant not identi¬ 
cally zero to be determined so that the elements of ( b t j ( X )) are rational 
in character at a; — oo (i. e. are analytic or have only a pole at x—oo). 
Otherwise stated, we shall prove the existence of n sets of entire functions 
*' V *«(*) determinant not identically zero for each of which 
we have 


(5) * ( .(*)«i(*) + ---+ t im (x)t.(x)-b,(x) (»— 1, • ••,»), 

fc,(r), ' ‘ i b H (x) being rational in character at x — oo. 

Let (tn tJ (x)) denote the matrix inverse to (l i} (x)) and defined by either 
set of equations 


(6) 


I ( ,(x) !*„(*) + • • ■ + /,.(*)»./*) - S„, 

m a( x ) >,/*) + • • • + '»,.(*) - s,„ 


(i.j-1 , 


•) Trans. Am. Math. Soc. 10(1909), p.436-470; in particular see p. 438—443. 
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The functions ni tJ (x) are clearly single-valued and analytic for [x'^li since 
the determinant of ( l tj (x )) does not vanish for such values of x. 

Let us put also 

(7) »»,,(*)»,,« + • • ■ + m lm (n)l nJ (l) - <5„+ *„(*, 0 (i,j= 1, 

and in this way define k tJ (x, t) for |:r| ^ J?, From the second 

set of equations (6) it follows that each function 

*«/*• 0 
t—x 


is analytic in x and t for \x\ ^ R, \t ^ R, since k tJ (x f t) vanishes identi¬ 
cally for x — t. 

From the above equations and (7) we deduce further the equations 

(8) /„(*)*,,(*, f) + • • • + *,.(*)*./(*> 0 + W*) " WO W - 1 

which may also serve to define the functions k tj (x, t). 

Now consider the set of n linear integral equations in f x (x), • • •, f n (x) 


(9) 


l 

2 nV 






dt - f t (x) - ^ m <i(*) Pii*) (•'- V • »0> 




where • • •, p n (x) are any polynomials in x t and the variables x, t 

are restricted to lie on the circle C for which jx| — P; the integration 
around C is taken in a positive sense. 

If we set :r — Re^ 9 , t — where 0 and <p are angular variables, 

and put 

1 kJae^.R^*) 

rw -W- Dx+e, 2(j— i)* + ip) (o<o, 9 <2*), 

- F(2(i-D* + 6 ) (0 < 0 < 2 r), 

*» 

^ - P(2«-1)* + 0) (0< 0 < 2*), 

according to a device due to Fredholm, our n equations may be combined 
into the single one 

I nn 

(10) / K(8, <p)F(tp)dtp - F(0) - P(0) 


where 2^(0, q>) and P(0) are continuous save for the discontinuities of a 
finite jump. This equation is of precisely the type of linear integral 
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equation studied by Fredholm*), and according to his theorems admits of 
a solution F{0) integrable in the sense of Rieraann, at least if P(0) 
satisfies k conditions of the form 

t*n 

fp(e)i> t (0)de- o («— !,•••, k). 

o 


Here tf» x (0), • • 4> k (0) are certain integrable functions of 6. 

Returning now to equations (9), we see that these equations are 
satisfied by a set of integrable functions f x (x), • • •, f n {x) for x on C, at 
least provided that the coefficients of the various powers of x in the 
polynomials p x (x), • • •, p m (x ) are subject to certain linear homogeneous 
conditions. Such a solution f(x), •• •, f u (x) is made up of functions ana¬ 
lytic in x on C, for the left-hand member of any equation (9) is analytic 
in x, and hence the first term f(x) in the right-hand member is also 
analytic in x for i — 1 , 2, • • •, n. 

For the equations (9) serve to define f x (x), • • •, f n (x) as 

single-valued analytic functions in terms of these functions on C. Thus 
we may regard the equations (9) as satisfied for , x | ^ R by such a set 
of functions. 

Let us restrict x to lie outside of C and let us consider one of the 


integrals 

( 11 ) 


i r (w dt 

tnY=lJ 




As we have seen, a function f(x) may be expressed as the sum of a 
function a t (x) analytic at x — oo and vanishing there, and of an entire 
function If we replace f(t) in the above integral by this sura 

a ( (t) + £,(*), the integral breaks up into the sum of two others 






/ ..wdt 

t - x 


The second of these partial integrals has the value zero by Cauchy’s 
integral theorem, since £,(<) is analytic for The transformation 

x = ^ 7 , t = y reduces the first partial integral to the form 



where C' is the circle 



described in a positive sense. 


Now the 


•) Acta Math. 27 (1903), p. 365—390. 
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function ja ( ( |-) is analytic within and on C', since a,(x) = f(x) - t { (x) 
is analytic for [a: and vanishes at x — oo. Hence, by Cauchy’s inte¬ 

gral formula, the expression in brackets reduces to \ a t . The integral 

(11) is therefore equal to — a t (z). 

If we add the integral (11) to both sides of the I th equation (9) for 
* = 1, • • •, w, we obtain the following equations 

2 *V-iJ **-*£*) — (*-l,•••,*)• 

c )ml 

Multiply the first of these equations by l n (x), the second by l n (x), and 
so on, and add the n resulting equations. By the aid of the equations 
(8) and (6) we may reduce the resulting equations to 

tnV—i J ~T=m~ dt ~2 l «/*)*>(*)-A(*) 

c 

The left-hand member of such an equation is clearly a function a t (x), 
analytic for |x| > 2? and also at x — oo where it vanishes; hence we have 

(12) *,,(*)«,(*) + • • • + l, H (x)t m (x) - Pi (x) + a t (x) - &,(*) (j - 1,.. n) 
where b t (x), • • •, b n (x) are rational in character at x — oo. 

We are thus led to formulate the following conclusion: if each 
function b t (z) in (5) is rational in character at x — oo, and is represented 
by the sum of a given polynomial p 4 (s) in x and of an unspecified function 
a i(x), analytic at x—■ oo and vanishing there, a set of corresponding entire 
functions *,(x), • • •, f„(x) will exist satisfying (5), at least provided that 
the coefficients of the powers of x in the polynomials p x (x) t • • •, p n (x) 
satisfy a certain set of k linear homogeneous conditions. 

In order that the existence of a solution of the corresponding matrix 
equation (4) be established, it remains only to be shown that it is pos¬ 
sible to choose n sets f, (x), • • •, f m (x) of determinant not identically zero. 

But this determinant will be zero if and only if the determinant of 
the corresponding sets b l (x) t • • •, b u (x) is zero, since the product of the 
determinant (not identically zero) and the determinant of the sets 

f i ( x )f * • •, t n (x ) is the determinant of the sets 6 t (x), • • •, b n {x). 

If the polynomials p t (z) are all of degree m or less, this last men¬ 
tioned determinant may be expanded in descending powers of x, the ex¬ 
pansion beginning with terms of degree mn or less in x. The term in 
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x mn here is clearly precisely the same as the corresponding term in the 
expansion of the determinant of the n sets p,(x), • • -, p„(x). It follows 
that if the determinant of the sets f, (x), - • t H (x) is to be identically 
zero, the determinant formed by the coefficients of x**" in the n sets 
V\ (*')» ‘ ’ '» PuC*) n,usfc vauis b- 

But each set of polynomials p t (x), •• - f p n (x) is subject to the same A 
conditions in order for a solution of (5) to exist. These conditions have 
been seen to be linear and homogeneous in the constant terms and coef¬ 
ficients of x, x * • • •, x"* occurring in the polynomials. It may be assumed 
that, as m increases indefinitely, these k equations of condition do not 
remain linearly dependent; otherwise all of the A- rowed determinants 
formed from the matrix of the A equations vanish, and a customary 
algebraic reduction in the number of conditions may be made. Choose in 
so large that the equations of conditions are linearly independent; if then 
tn is changed to mi + 1, it is clear that the new unknowns introduced 
may be taken entirely arbitrary. Hence the determinant of the leading 
coefficients referred to above need not vanish, although all the n sets 
Pi (*)»•••> Pn( x ) satisfy the A conditions. A solution of (4) with the 
specified properties must therefore exist. 

This completely establishes the first and most important step in the 
proof of the theorem. 

The second step is to 6how that by three series of modifications the 
solution of (4j may be made to have the three following additional pro¬ 
perties: first, that the determinant A(x) of {f tJ (x)) is nowhere zero in the 
finite plane; secondly, that (b tJ (X)) has the form (a lf (x)x*i), where x,,«*«,x w 
are integers such that x, <> x. < • • • x„ and (a 4> (x)) is a matrix of func¬ 
tions analytic at x — oo of determinant not zero there; thirdly, that for 
a suitable rearrangement A, , • • A„ of x,,- -, x„ every function t tJ (x) 
vanishes at x »= 0 at least to the order Xj — A 4 if A. < x f . 

Let us observe that A(x) is an eutire function which vanishes only 
a finite number of times in the finite plaue. In fact A(x) only vanishes 
a finite number of times for |x| ^ J{ since this function is then analytic. 
Also it vanishes a finite number of times for x ^ ](\ for the product 
of the determinants of |/, 7 (x)| and A(x) - |«, 7 (x)| is | b xj (r) , analytic for 
x ^ Ji and rational in character at x — oo so that b {j (x) only vanishes 
a finite number of times for .x ^ I{. 

We may replace the solution (« t7 (x)) of (4) by a new matrix solution, 
and at the same time remove one of the zeros of A(x), these being counted 
with their proper multiplicity. Suppose that A(x) vanishes at j = a. In 
this case we may choose a set of constants c n , • • •, c Ntf not all zero, so that 

O 3 ) + = 0 (i= 1,2, 
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Choose now further e t9 ,--- 9 c m9 , c xl , • • •, c, s> in any 

way so that the determinant \c t) is not zero. If we multiply the equa¬ 
tion (4) through by (c tJ ) on the right, the matrix (f 4/ (x)) is replaced by 
(e 0 <x»(c 0 ). From (13) we infer that the elements of the first column 
of this last matrix all vanish at x = a. The right hand side still consists 
of a matrix of functions rational in character at x — oo. 

In this equation 

[(«,/<*')(*./)] - (&,>(*))(«</) 

we may remove a factor x — a in the first column of the matrix forming 
the left-hand side by removing it from the first column of [(*,,(£))(c (y )J. 
This operation changes [(*,/£)) (c ( ^)] to a matrix (f^(a?>) of entire func¬ 
tions, since all the elements in the first column of [(«,j(g)) (c 4/ )] vanish 

at x — a. Furthermore the determinant of is clearly The 

same operation of division replaces (b if (xj) by a matrix (&',/*)) of func¬ 
tions rational in character at x — oo. 

Thus we obtain a new matrix solution of (4), and its determinant is 
the same as that of ihe first solution except that a factor x — a, corres¬ 
ponding to the zero at x — a, has been removed. 

By a succession of such steps we may remove all of the finite set 
of zeros of A (a:), and finally obtain a solution (f if (x)) of (4) whose deter¬ 
minant A (a:) never vanishes. 

We shall now show that not only may we make A (a:) + 0, but also 
that by further modification we may reduce (b t ,(x)) to the form {a if {x)x r> ) 
where (a,/X)) is a matrix of functions analytic at x — oo, of determinant 
not zero there.*) 

To prove the existence of such a solution we note that in (4) we 
may make the same interchange of columns in («,/*)) and in (6 4/ (x)) to 
obtain new matrices (t tJ (X)) and yb' lf {xj) which satisfy (4); the deter¬ 
minants A(a;) and A' (x) — | e it {x) | are the same or at most differ only 
in sign. Likewise we may add to the elements of any columu of (f (> (x)) 
a sum of the corresponding elements of other columns multiplied by poly¬ 
nomials in x, and thus obtain a matrix («'^(r)) of entire functions whose 
determinant A'(ar) is equal to A(x); the corresponding matrix (b' i} {x)) is 
obtained by the same operation on the columns of (b 4 j(x)). 

We may start therefore with any matrix (b^ix)) corresponding to a 
solution (ttj(x)) of (4), and be led to new solutions for which (b^yx)) is 
modified either by a succession of arbitrary interchanges of columns, or 

•) Cf. Hensel uud Laudsberg, Theorie der algebraischen Funktionen einor 
Variabeln, p. 158—173. 
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by adding to any given columns a sum of multiples of other columns 
by arbitrary polynomials. At each step A(.r) remains unaltered or changes 
to — A(x), so that the property A(i) + 0 is preserved. 

These two transformations of (b {J <x)) suffice to give it the stated 
form. In the first place we may put in evidence in each column of 
(&,/*)) the lowesfc P ower of x which may be removed and leave all the 
elements of the column analytic at x - oo, by writing it in the form 

{a ti {x)x*i) where the functions a tJ (x) are analytic at 1=00 and do not 
all vanish for any fixed value of j. A proper interchange of columns 
makes x x ^ x t <[ • • • ^ x m . It the determinant of (a IJ (x)) does not vanish 
at x - oo our statement is proved, so that we can limit ourselves to the 
case when this determinant does vanish at x — oo. Since a tj (x) vanishes 
at x - oo, we may choose quantities c lt • • c mi not aU zero, so that 

a «»W c i + b a im (x ) c m =- 0 at r = oo (t — 1, 2, • • •, w). 
Let c a be the last of the quantities c lt • •c m which is not zero. On 
division of the above equations by c ai these take the form 

+ ’ * * + + « ia (x) = 0 at i-co (t - 1, 2, •••, n). 

Consequently if we multiply the first column of («„(*)*'') by the poly¬ 
nomial the second by the polynomial and add, 

element for element, the result to the column, that column becomes 

(a iX <x)d x + •• + a ia fx)) x*“ (/- 1, • • -, n). 

Hence we obtain a matrix (&;,(*)) the same as (b lf (i)) except for its « th 
column where we have a factor a?*- 1 in evidence since the expression in 
parenthesis vanishes at * - <x> for all values of i . That is, we have changed 
* a to x «” leaving x xt • • •, x a + 1 , • • x„ unchanged. 

We may proceed by a finite succession of such steps until this 
process comes to an end. For, the value of the determinant |6,/x)| is 
**, + ...+*„ I a if {x) and is unaltered in the process above given. If *, + "• + x„ 
could be indefinitely decreased , then |6„(:r)| would vanish to an arbitrarily 
high order at a = oo, and hence would be identically zero. When this 
process comes to an end, (&,/*)) has the form {a u (x)x*J) where a,/*) is 
a matrix of functions analytic at x = c© of determinant not zero there. 

A rearrangement of the columns of (e ;> (x)) and (6,/x)) may be made 
so that x, and the second series of modification is complete. 

Finally we make the third and last series of modifications which, for 
some arrangement k x , •••, k m of x lt •••, x„, makes e,j(x) vanish at x==0 to 
at least the order x f - k i for aU values of / and j such that k t < x. This 
reduction is to be effected by modifications which do not impair the two 
properties already secured for the solution of (4). 

Mathemitisclie Annalen. I.XXIV. 9 
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On account of the first of these properties the determinant ! | 

is not zero at x = 0. Consequently the (n—l)-rowed minor formed by 
striking some row and the column of this determinant does not 
vanish at x = 0; likewise an (n —2)-rowed minor formed by further 
striking out some row and the (» — l) tb column will not be zero 

at x — 0. Continuing in this way we obtain a set of integers a,,..., a n 
forming a permutation of the series of nntural integers 1, 2, •••,»! such that 
the determinant formed by striking out all the last n — p columns and 
the «^ + 1 ,h , • • •, <*„‘ h rows is not zero ut x — 0. 

Let us first consider the case in which we can take 

“i- 1. «t“2, <*„-« 

so that none of the series of determinants 


*!.(*)» 'llW'aW - ^(X) 

vanishes at x — 0. The argument in the more geueral case is then dis¬ 
posed of without difficulty. 

If we multiply equation (4) through on the right by a matrix 


(A/*)) 3 



Pit(z) 

1 


0 


P,>( x ) ••■?,„(*) 
/>»>(*) •••?..(*) 


where we have p if (x) -■ 0 for « > j, p ti (x) — 1, and where p it {x) for i <j 
is an arbitrary polynomial of degree Xj— x lt the matrix fa/x)) is replaced 
by another matrix («,/£)) (p t )(x)) of entire functions of determinant the 
same as that of (t tJ (x)) since we have !;>,/*) | — 1. Moreover 

(*««*» - (««(*)*'■') 

is replaced by (a iJ (x)z ,> j (p,/x)) which may be written 
^•«n (*)» x''(«„'X)p„(x)x'‘-'' + a lt (x)), • • 
x'-a.i (x), x'*(oji (x)y> lt (x) x'*-*• + a„ (x», • • 

x'-a„,(x), x'Mfl. 1 (x)p,.(x)x'--'-+ a.,(x)), • • 



This is a matrix of the form (a tJ (x)x’S) where every function a tj (x) is 
analytic at x — cc and where the determinant \a t j(x)\ retains its former 
value. Thus we have found a further modification which preserves the 
two properties already secured for the solutions of (5): namely, we may 
replace (*,/*)) bv («,/*)) (/>„(*)) and by (6,/z)) (p tJ {z)). 

Now consider the first element e u (x)p lt (x) + £,*(*) in the second 
column and first row of (*,/*)) ( p,j(P>). Since e n (x) is not zero at x = 0, 
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it is clear that by properly choosing the polynomial p„(x) we may make 
this element vanish to at least the order at *-0. 

Next consider the first two elements 

(14) .„(*)*,,(*) + *,.(*)*.<*> +«»(*) and 

in the third column of the same matrix. It is possibU to make botWf 
these vanish to the degree *,-«,bya proper cho.ce of the first x, , 
coefficients in the polynomials *,(*) and p a (x). For we have to make 
vanish the expresses obtained by putting * - 0 in the above and m 
the 1 P» irs of expressions obtained by sucessive differentiations 

at *_0 The first pair of equations determines the constant terms mj„(i 
and *,(*) since by hypothesis the determinant *„(*)«*(*)- *..(*> ««iW 
is not zero at x = 0. The second pair of equations determines the coef¬ 
ficients of the first powers of x in these polynomials, since these equations 
are linear in these coefficients with the same determinant as the first pair 
of equations. Continuing in this way we make both elements (14) vanish 
to the order x,-x, at x-0. Furthermore we may, in addition, make 
the first of these vanish to the degree x,-x„ by choosing properly the 
remaining x,-x, arbitrary coefficients in pjx) of degree x,-x,; we 
note that the coefficient «„(*) of />„(*) is not zero at r - 0 

Determining the functions p„{x) by a series of stops like the above 
„e may modify («„(*)) so that for any i <j the element «„(x) vanishes 
to the order X> - * ( at least at * - 0. That is, the solution will have tho 

stated third property if we take Jq “ x. — > x„ — /.„• 

If we cannot take 1, we can nevertheless by the 

same process make the second element in the row of («,/*)) Ob/*)) 
vanish to the order x,-x„ and then make the third elements in the 
u ,b and a,'" row vanish to the order x s — x, and x 3 — x. respectively at 
0, and so on. In other words, if A, , is the same permu¬ 

tation of x„ • • •, x. as «„ •••,«. is of 1, • ■ -, », then the modified (*,/*)) 
will be such that whenever we have x f > l\ the element («,/*/) vanish 
at least to the order x > — k\ at x — 0. 

This completes the reduction of the solution of (4) to a form pos¬ 
sessing the stated three properties. 

The third and final step is now made at once. On account of the last 

mentioned property of («„(*)), we may write it in the form (s'/*) **'' ') 
where any function i\ t {x) wiU certainly be entire if il x y <.fc, 

the function is also entire since it is the same as x ' ^f (/ (x). The 

determinant ^(g) is the same as that of J Hence *,■/*); is a 
matrix of entire functions of determinant nowhere zero. 

The matrix may be written id, aud 
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the matrix (a ii (x)x* s ) may be written (a,/*)) Consequently from 

(4) we obtain 

(»„<*)> (t,,*-' 1 ) «,<*)) (8„J>) - (a,/*)) fa,**'), 

whence we find 

(*,,<*» - ( a ‘/ z >) (»„**')• 

Since the matrix (fj/£)) _x is also a matrix of entire functions of deter¬ 
minant the reciprocal of | e]j(x) | and not zero for any finite x, this equa¬ 
tion may be written 

&/*» - 

where ( a if (x)) f (e tJ (x)), and k lf • • *, k u have all the properties specified in 
the theorem except that (a, 7 (x)) need not reduce to (d if ) at x — oo. However 
if (a^(x)) reduces to (c if ) of determinant hot zero it is possible to secure 
this final property for a„(x) by replacing (a„(x)) and (e (/ (x)x *') by 
(a,, 1 .x)) and (e„) (e.j(x)x^) 

respectively. Thus the theorem is demonstrated. 

One is naturally led to inquire to what extent the matrices (a,^(x)), 
(?ij(x)) f and the integers k l9 • • •, k m are determined for a given matrix 
(lij(x)). In the case n — 1 the explicit solution obtained at the outset 
is unique. 

For w>l, a first simple fact is that for a given £,,•••,&„ the 
matrices ( a if (x )), (e,/x)) are uniquely determined in case such matrices 
exist. In fact, if a second distinct choice (fl,/(x)) and (e if (x)) of these 
matrices were possible, we should have 

(a tJ (x)) (e t ,{x)) - (dij(x)) (€„(x)) 
or 

(5„ur))- 1 (a„(x)) - {c ti (x>) (e 0 (x»- *. 

But the matrix (rpo(x)) on the left side of this last equation is a matrix 
of functions analytic at x—oo and reducing to (d t j) at x —oo; and at the 
same time is a matrix of entire functions, as the right hand side of the 
same equation shows. Hence (<p,,(x>) must reduce identically to the unit 
matrix, and thus we have (a,>(x)) — (a (/ (x)) and (e,,(x)) — (e tJ {x)) contrary 
to hypothesis. 

Secondly, we observe that if a second choice (a„(x)) and (e,y(x)), of 
and (e^(x)), be at hand for a second choice f,, • • •, k n of k ly • • •, k nf 
then we must have 

*i H-+ k m = k t +-h k m . 

For we have the obvious relation between determinants 

*(x) -x* 1 ■*■■■■♦ *»|«u(*)| \etj(x)\ -x*» + - + *»|a. v (x)| \e tJ (x)\ 
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where -Mx) is clearly single-valued and analytic m the vicinity of x - oo, 
and does not vanish there. Also |a (> (x) and ««(*), are analytic at x - °o 
and do not vanish there, while | «.>(*) | and ?„(*) are entire functions 
nowhere zero in the finite plane. But »(*) admits of a separation into 
factors of this form in only one way, as has been noted. Thus we obtain 

the stated relation. _ . , 

For a given matrix (/„(*)), the applications that I have made elsewhere 

show that the values of *„ •••,*. are restricted by one or more relations 

of the above form. The general case appears to be that in which there 

is no other condition but the one obtained above, but in particular cases 

Jc ... k may be wholly determined. Thus the distribution of the possible 

sets of values of k lf • • •, *„ may be complicated, although for each set 

there is but one choice of («,,(*)) and (*,(*)). It is also not difficult to 

show that it is possible to pass from one solution (a,,(*)), (*.>(*» and 

to an 7 ot ^ er ^7 means of matrices of polynomials in — of 

determinant identically 1. That is, the transcendental part of the problem 
is completely solved when a single determination of (a,,(*)), (*/(*)) an(1 
JL, • • •, k n has been effected. 
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Equivalent Singular Points of Ordinary Linear Differential 

Equations. 

By 

Gf.oroe D. Birkhoff of Cambridge, Mass. (U. 8. A.). 


In a paper published in 1909*> 1 introduced the notion of equivalent 
singular points of ordinary Irnear differential equations. But the theorem 
of classification to which this notion gave rise was incomplete because 
certain exceptional cases were laid to one side.**) In the present paper 
I prove the theorem in its generality, and base ray proof on the auxiliary 
function-theoretic theorem whose demonstration is found in the immediately 
preceding paper in these Mathematische Annalen. 

Suppose that we are given a linear differential system 

* 

(1) * y x ‘ P‘l(. z )Vi (i-1,8, •••,») 

>-» 

where the functions p tf (x) are analytic functions of the independent 
variable x. Any finite point, x - a will be an ordinary point of this 
system if the functions p tJ (x) are all analytic at x - a\ the point x — oo 
will be an ordinary point if all the functions p, y (x) vanish to the second 
order at least at x — oo. Any point which is not an ordinary point is 
termed a singular point of the linear differential system; we restrict our¬ 
selves to singular points at which the functions p ff (x) are rational in 
character i e. are analytic or have a pole. It is no restriction to assume 
that the singular poiut under consideration lies at x = oo, and this we 
shall do. 

In the vicinity of x = oo each coefficient p if (x) may be expanded 
in a series of descending integral powers of x. If q is the greatest exponent 
of the leading power of x in any of these series, then q + 1 is the 

•) Trans. Am. Math. Soc. 10, p. 436—470. 

••) I.oc. cit. p. 446 and p. 463. 
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ran/.’ of the singular point x - oo.*) The case of rank 0 ( 4 - - D is the 

regular singular point. . 

Any linear transformation of the dependent variables in (1) 


( 2 ) 

j~l 

in which the functions a if (x) are analytic ati-co and such that the 
determinant |a 4 y (x)| is not zero at x - oo takes the given linear ditleren- 
tial system ( 1 ) into a transformed system 


(3) d ii-i M*>* 

of the same form. The explicit expression for f tJ (x) is given by the 
formula 

■ * ^ ^ 

(4) P„(x) -^«,»(*)/**.(*)55 

k.lm I tml 

(«>i-!,•••» >0 

where (a 4y (x)) is the matrix of functions inverse to (<?,/*)).**) The equa¬ 
tions (3) may be found by direct substitution. 

If two linear differential systems (1) and (3) are related by a trans¬ 
formation ( 2 ) in which the function « 4 > (x) are not only analytic at x-oo 
but reduce to d 4y at x - oo (d, 4 -l and d, y - 0 for !+./) we will say 
that the two systems have an equivalent singular point at x — oo.***) 
Since the transformation inverse to (2) is clearly of the same form, this 
relation of equivalence is a symmetric one. Likewise, since the composition 
of two transformations like ( 2 ) is another of the same form, the relation 
of equivalence is transitive. 

It is obvious from the form (4) of the coefficients p ti {x) that the 
rank of neither one of two systems having an equivalent singular point 
at x = oo can exceed that of the other, and hence that the rank of all 
such systems is the same. 


•) For these definitions see Schlesinger, Vorlesungen fiber lineare Differential- 
gleichungen, p. 181 . 

•*) For an exposition of the elementary properties of matrices used in the present 
paper see Schlesinger, loc. cit., p. 18—19. 

•*•) This definition is more satisfactory than the one used in my earlier paper 
(Trans. Am. Math. Soc., loc. cit.) where the functions a ; <x) were merely restricted to 
be rational in character at x = ». 


253 



136 


G. D. Bibkhofp. 


The importance of the notion of equivalent singular points lies in 
the fact that the solutions of linear differential systems (1) and (3) with 
equivalent singular points x = oo must be of essentially the same nature 
in the vicinity of x — oo since the coefficients a if {x) in the trans¬ 
formation (2) are analytic at i = oo. 1 have made application of the 
notion of equivalence in my paper in the Transactions of the American 
Mathematical Society. 

We now proceed to prove the theorem: Every linear differential system 
(1) with a singular point of rank 2+1 at x — oo is equivalent at x — oo 
to a canonical system of the form 

n 

(5) 0- 1, *,-,«•) 

J" I 

in which P tJ (x) are polynomials of degree not greater than q+ 1. 

Proof. Outside of a certain circle \x\ — R in the complex plane, all 
of the points x — a are ordinary points of (1). According to the funda¬ 
mental existence theorem for a linear system (1) we infer then that there 
exists a set of n linearly independent solutions of (1), 

(6) (y,i. • • y..), (jfii. • • •. y. 3 ). •••»(*..•••> y..). 

each element of any one of which is analytic for the general 

solution may be expressed as a linear combination of these particular 
solutions. The elements of these solutions are not in general single-valued 
since, when the independent variable x makes a positive circuit of x — oo, 
these solutions alter respectively to a new set 

(yin * * *» y*i)> (£it> ■ ■»y**)> ■ ■ ■» @i«» ■ ■ ’»y**) 

which are of course linearly independent. Each one of this set of solu¬ 
tions is linearly dependent on the first set of solutions; this relation 
may be indicated by the matrix formula 

(7) (y„) - (y,/) («»,) 

where (c (> ) is a matrix of constants of determinant not zero. Thus the 
set of solutions undergoes a linear transformation when x makes a positive 
circuit of x = oo. 

Now according to the well-known theory of such transformations it 
will be possible, except in special cases, to choose an initial set (6) of 
solutions so that the matrix (c t> ) takes the simple normal form 
i. e. there will exist n linearly independent solutions (6) such that each 
element of the j lb one of these is merely multiplied by a factor when 
x makes a positive circuit of x= oo. For the moment we will assume 
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that ( 6 ) is a set of solutions with this property. Later it will be seen 
that only a slight alteration is necessary to treat the special cases.*) 

Now define the quantities • • •, X n up to an additive integer by 
the equations 

1 j - lo 8 e, (j -1,2, • • ,»), 

and then define the functions ) by the equations 

( 8 ) Vij( x ) " *</*)*'• 

These functions are single-valued and analytic for since x> is 

multiplied by q } when x makes a positive circuit of x — oo. Moreover 
the determinant 

l - , *■ , *—■*- —•*-« w >- 

is not zero for \x \ ^ R**). Hence, by the auxiliary theorem above referred 
to, we may decompose into a product of matrices 

(*./*>) ~ («.,<*>) 

where (e (> (a:))'is a matrix of functions analytic at x — oo reducing to the 
unit matrix (d <y ) at x — oo, where (e tJ (x)) is a matrix of entire functions 
of determinant nowhere zero in the finite plane, and where k tr • • •, k H are 
integers. 

Let us choose the functions o tf (x) thus obtained as the functions 
a it (x) in the transformation (2). From the equations (8) and the last 
equation we obtain 

(».,) - (v*» (*«,) 

where 

(9) -«„(*)*"• 

Hence the n sets of functions (Y lJf • • •, Y nJ ) where j — 1, • • •, n form a 
set of solutions of the transformed equation (3). 

If we substitute each of these solutions in the equations (3) in suc¬ 
cession we may combine the n* resulting equations into the single matrix 
equation 

© - (F«W> (r„) 

whence we find 

(10) (*„<*» - © V'jrK 


*) For a complete discussion of the facts outlined here see Schlesinger, loc. 
cit., p. 90—104. 

**) Schlesinger, loc. cit., p. 21. 
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<,Y„) - - («„(*» (■»„*") 

and also 

(SO - (*“' Sx •«(*)+m ,*-' -1 *„(*)) - (r,{*)=f‘-') - (/;,(*)) 

where the functions f if (x) are entire functions. From (10) we have then 

- (/•„<*» {a,,*-' 1 ) (*„<*»“ 1 - V (f„W)(.e, ,<*))-. 

Hence the matrix (p tJ (x)) is made up of functions single-valued and 
analytic in the finite plane except for a possible pole of the first order 
at x — 0. 

Moreover, since the rank of the singular point x — oo of (3) is q + 1, 
the functions p tJ (x) are clearly rational in character at x —oo, and their 
expression in descending powers of x begins with a term in the g ,h power 
of x or in a lower power. 

Consequently the functions xp it (x) are analytic everywhere in the 
finite plane with a pole of at most the (q + 1)"* order at x — oo, and 
must be polynomials P it (x) of degree q + 1 at most. It follows at once 
that (3) has the form stated in the theorem. 

Thus the theorem is proved in the case where all the multipliers 
p,, •••,£„ are distinct (or more exactly, when the elementary divisors of 
the matrix — d tJ X) are distinct). 

It remains to account for the degenerate case when two or more of 
these multipliers become equal. We consider merely the simplest case 
when two values of p become equal, say p, and p,, as well as the corres¬ 
ponding elementary divisors. In this case »i linearly independent solutions 

(yin •••# y»J» •••» O/.w •••» .O 

may be found such that when x makes a positive circuit of infinity these 
become respectively 

• ■ ■, 9,y„) (p.yu+y.i.■ ■ •. Piy. : +y.i) • • • (p„y... • • •. p .’j..)- 

In this caso we define l lt • • •, X m as before and write 

y.. - *h(*)A y,» - ( + , ' ,w ) **'’ y,s ” W *)* 1 * 1 ’' •» 

thus defining a matrix (/ t> (x)) of functions which again are single-valued 
and analytic for x ^ Ii, and of determinant 

x - 2W .-...-a - j yi/ ( x )| + o for |*| i-B. 

This matrix therefore satisfies the conditions prescribed in the auxiliary 
theorem. 
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Equivalent •ingular point*. 


Now, proceeding essentially as before, we infer that the functions 

rn (x) are polynomial in x of degree at most q + 1. . 

The formulas (4) show that in all cases the coefficient of a? m 



is the same as the corresponding coefficient in the function P*/*)- 

In the simplest case of a regular singular point the canomca 

system is of the simple soluble form 

(»- 1,2, •••,*!), 




) ■ ‘ 


where the constants p tJ are the coefficients of ± in the expansion of 
p..(x) in descending powers of x. 

From this fact the fundamental existence theorem for a regular singular 
point may be at once derived. 
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Berichtigung 

7 .u Hem Aufsatze von George D. Birkhoff: „A Theorem on Matrices of Analytic 
Functions" Math. Ann. 74, S. 122—138. 

Recently I have noticed, and it has been kindly brought to my attention by 
Professor Otto Toeplitz also, that the theorem of which a second proof is given by 
me in the paper ‘A Theorem on Matrices of Analytic Functions' is essentially equi¬ 
valent to the solution of a special case of the extended Riemann problem solved 
by Hilbert and Plemelj; this second proof is not markedly different from theirs. 
Suitable statement of the facts with due references will be found in my papers ‘A 
Simple Type of Irregular Singular Point', to appear in the October (1913) number 
of the Transactions of the American Mathematical Society, and ‘On the Generalized 
Problem of Riemann for Linear Differential Equations and the Allied Problems for 
Linear Difference and ^-Difference Equations' to appear at once in the Proceedings 
of the American Academy of Arts and Sciences. 

George D. Birkhoff. 
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THE GENERALIZED RIEMANN PROBLEM FOR LINEAR 
DIFFERENTIAL EQUATIONS AND THE ALLIED 
PROBLEMS FOR LINEAR DIFFERENCE AND 
,-DIFFERENCE EQUATIONS. 

By George D. Birkhopf. 


Received June 9. 1913. 

The program of obtaining a characterization of a function in sim¬ 
ple descriptive terms which are independent of the equations of 
definition of the function is a familiar one. To Riemann is due 
the formulation of this characterization for the algebraic functions 
and for the functions defined by ordinary linear differential equations 
without irregular singular points. In both of these instances the 
characterization involves a certain number of characteristic constants 

_the monodromic group constants in the last mentioned instance. 

Riemann also proposed the associated problem of assigning these 
constants at pleasure. 1 

During the last few years I have discovered that the program 
admits of extension in a number of directions. The aim of the present 
paper is to solve the generalized problem of Riemann for ordinary 
linear differential equations with irregular singular points, and the 
analogous problem for linear difference equations and for linear <j-dif- 
ference equations. The formulation of the first and second of these 
problems has been given by me earlier. 2 At about the same time as 
myself, Norlund, in his fundamental work on linear difference equa¬ 
tions, was led to formulate essentially the second problem. 3 The 
third is stated in the present paper. 

The problem of Riemann for linear differential equations in its 


i (SS>. and 12.243-284 (1911). These 

two papers will be referred to as I and II respectively. 

3 MSmoires de l’AcadSmie Royale des Sciences et des Lettres de Danemark, 
series 7, 6, 309-326 (1911); C. R. vol. 156. pp. 200-202 (1913). 

In the second of these papers. Norlund gives a formulation and explicit 
solution of the hypergeometric difference equation problem. 
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classic form was first solved by Hilbert. 4 His treatment and Plemelj’s 
elegant completion thereof 5 reposed alike upon a certain theorem 
whose proof was made by means of the Fredholm theory. Owing 
to the deep-seated analogy between linear differential and difference 
and ^-difference equations, I have been able to apply a convenient 
extension of the same theorem in all cases; my proof is based on a 
method of successive approximations. 

Inasmuch as I have been able to simplify Hilbert’s and Plemelj’s 
treatment of the classic Hiemann problem, I have ventured to include 
my treatment of it also. 


Part 1. The Preliminary Theorem. 

§ 1. Some Definitions. 

Let C be a simply closed analytic curve in the complex x-plane. 
If the arc length along this curve from a fixed to a variable point is 
measured by s, and if / be the length of C, it is clear that x is a single¬ 
valued analytic function of s with period / for s real, and that dx/ds 
is not zero. Consequently if we introduce a new variable r defined 
by 

2» V-u 
r = e 1 , 

a one-to-one analytic correspondence is set up between the points 
of the unit circle | r | = 1 in the r-plane and the points of C. It will 
therefore be possible to choose p > 1 so that the circular ring in the 
r-plane, 

~ ^ M = p> 

is transformed in a one-to-one and conformal manner into a ring in 
the x-plane bounded by simply closed analytic curves C\ and C 2 , 
within and without C respectively, while at the same time the circle 
| r | = 1 is transformed into C. Let r = r (x) be the function which 
effects this transformation. 

Also let a (x) be any function continuous together with its deriva¬ 
tives of all orders along C , 8 and analytic save at a finite number of 


4 

6 

0 


Gott. Nachr. (1905), pp. 307-338. 

Monatsh. f. Math. u. Phys., 19. 205-246 (1908). 

By definition we take df\x)/dx along a curve L as follows: 

df(x) = um /(*') -/(x) 
dx x'-x 


( z , x on L). 
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points of C. These restric¬ 
tions on o (x) ensure that we 
can choose regular curves D x 
and D 2 , within and without 
C respectively, and osculating 
C at the points where a (x) is 
not analytic (Fig. 1) in such a 
way that on the continua lim¬ 
ited by Du D 2 , we have 


(1) ,a(x)|gK. 



a (x) — a (x) 
I *-*' 



in these continua a (x) is defined as the analytic extension of a (*) on 
C It is possible to extend further the definition of a (*) throug ou 
the ring formed by C„ C, in such wise that inequalities of the type 
(1) hold; for this purpose it is clearly sufficient to choose real and 
imaginary components that join on continuously to the like com¬ 
ponents of a (*) along D, and D„ and to make each component satisfy 
inequalities of the same nature as (1). Such a choice can always be 
made. 

§ 2. On a First Type of Integral. 


Let us turn now to consider the integral 


( 2 ) 


* f atfidt, 

2 tt V— 1 Jc t — x 


where gHx) is a function analytic within C and continuous along C, 
and p is zero or a positive integer. Following Plemelj (loc. at) we 
shall term a function g+(x) of this description a regular inner function 
and affix to it a superscript +; likewise a superscript - will indicate 
that a function is a regular outer function, 1 . e. is analytic in the 
extended plane without C, and continuous along C. 

We can demonstrate at once that the integral (2) represents a regu¬ 
lar inner function f*(x), or a regular outer function / (x), according 
as x is within or without C. In the first place these functions are 
analytic within and without C respectively, as appears from ( 2 ). 

In the second place, by Cauchy’s integral theorem we have 



1 frWM,. 1 
0 = ^nJc-T^ dl -2r^--Jc, t-x 


26 ) 
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provided that x lies between C and C 2 , since the function t p (x) g+(x) 
is analytic in the ring C, C\ and continuous along its boundary. Thus 
we may write 




+ 


«to_ f r>(Q^(0 

2ttV—1 Jcx t — X 


The first integrand on the right-hand side is continuous in x and t, for 
t on C and x in the ring C, C 2 unless x — t, when the integrand is not 
defined; in the neighborhood of a pair of values x = t, the integrand 
remains finite by (1). Hence the first integral approaches a continu¬ 
ous limit as x approaches the boundary. Inasmuch as t is restricted 
to lie on C\ in the second integrand, the same statement is certainly 
true of the second integral. Hence /“(x) may be so defined as to be 
continuous along C. 

Likewise by means of the relation 


(5) 


t*(x) g+(x) = 


2 ?c V— 1 Jc t — x 

1 r ^(0r(0 A 

2 w V^T JC\ t-x M 


valid for x between C and C\ by Cauchy’s integral formula, we obtain 
( 6 ) r to - r W to afcr) = 2 -^= f c r-(t) g' (i) dt 

2 jtV_I Jet t — x 


From this equation we can at once infer that f*(x) may be so defined 
as to be continuous along C. 

Thus f+(x) and /"(x) are respectively regular inner and outer func¬ 
tions. 

A comparison of the relations (4) and ( 6 ) which are both valid along 
C gives us the fundamental equation 

(7) r to - /- to = tP to 9* to a to along C. 

Let us now consider the maximum modulus of /“to outside of or 
along C. This maximum modulus, and likewise that for 0 + (z) within 
or along C, are attained on C of course. Suppose that we have along C 

(8) liTtol^. 
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Now modify the contour C of integration in (4) to D\. The integrand 
is analytic in / over the continua enclosed by C and D i, so that the 
value of the integral will not thereby be altered. (It must be remem¬ 
bered that x lies without C in (4).) From this modified form of (4) 
we obtain 


<9) 


/■« isg 



upon applying (8) and (1). 

But the two integrals which appear in the right hand member of 
this inequality tend to zero as the unspecified integer p increases; in 
fact we have | r (x) | < 1 within C so that r p (x) tends uniformly to 
zero in any closed continuum within C, as p becomes infinite. It is 
to be observed that the quantity 1 t — x | which appears in the second 
integrand is never less than the minimum distance from C to C\ t 
since x lies without C, and t is a point of C\. 

These considerations demonstrate that for a given positive e, 
however small, the integer p may be chosen so large that for every 
regular inner function g*(x), we have 


(10) maximum of \f~(x) | £ t {maximum of < 7 *(x)) along C. 


§ 2. On a Second Analogous Type of Integral. 
In the same way we may treat an integral 


( 2 ') 


1 _ f r-*(t)g-(t) 

2 ir V— i Jc t — x 


a (0 dt, 


where g~{x) is a regular outer function, and p is zero or a positive 
integer. As before we denote the value of the integral for x within C 
by /*(*)» and for x without C by / _ (x). A discussion parallel to the 
earlier one in § 1 shows that / + (x) and f~{x) as thus defined are 
regular inner and outer functions respectively; in this case equation 
(4) is replaced by 

<4 '> r w - IT^r! Sc ® <« i' 

I flW r T- p (t)g (t) J 
2 .VZnJca t-x dt > 
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and (6) likewise by 

(6') f-(x)+r--(x)g-(x)a(x)= —J 

2 ir v 


i 


° (j) dt 

c t — X 


+ 


to f T~ p ft)r (0 

2t V— l Jcs t — x 


From these two equations there results at once 
(7') fW-r to = T ~ p to 9~ to « to along C. 

In order to develop an inequality for the modulus of f+(x) in this 
case, we note that the contour C in (4') may be modified to D 2 . The 
modification yields 

where L is the maximum of | »7"to | along C. But | r~ p (x) | tends to 
zero for x outside of C as p becomes infinite, since for such an x we 
have | r(.r) | > 1 . We conclude therefore that for any positive t 
however small, the integer p may be taken so large that for every 
regular outer function g~(x) we have 

(10') maximum of f* to ! ^ «I maximum of ^“to II along C. 

A further property of the function /“to, which is apparent from its 
definition, is that this function vanishes at x = oo . 


§ 3. Solution of a Pair of Matrix Equations. 

Throughout the present paper we shall be concerned with linear 
equations in n unknown functions, whose complete solution may be 
expressed in terms of n particular solutions. On this account we 
shall employ the matrix notation. 

We consider first a pair of matrix equations 

F+ to — F“ (x) = t p to 0* to A (x)> 

G- to - G* to = T ~ p to F~ to -'I" 1 to - /, 

Here r(.r) is the function defined in § 1; the matrix A(x) is a given 
matrix (tf#(x)) (i,j = 1 ,.. .,n) of which each element is defined along 
C and has the properties specified in § 1 for the function a(x) (namely, 
it is continuous together with its derivatives of all orders along C, 
and analytic save at a finite number of points); furthermore the de¬ 
terminant | i-l(x) I »s not to be zero along C. The symbol I stands 



along C. 
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for the unit matrix (5 i; ) in which = 1, = 0 for i j* j, and A “ (x) 
stands for the matrix inverse to A(x). The matrices F+(x), G+(x), 
F~(x), G~(x ) are to be determined to satisfy (11), the first two as 
matrices of regular inner functions and the last two as matrices of 
regular outer functions. The matrix products on the right hand side 
are the customary matrix products, and the factors r p (x), t" p (x) stand 
for the matrices (r p (x) 6 0 ) and (r" p (x) $,,) respectively. 

It may be proved without difficulty that for p taken large enough 
a solution of these equations exists. To effect this proof we apply a 
method of successive approximations based on the sequence of 
equations 


F 0 +(x) = F 0 ”(x) - Gr(x) = 0. Go* Or) = I, 
( FS (x) - Fr (x) = t p (x) Go* (x) A (x), 

(12) ( Gr (x) - G, + (x) - r- p (x) Fo- (x) A’' (x) - /, 
( Ff (x) - Fr (x) = r p (x) G? (x) A (x), 

( ft* (*) - Gf (x) = r- p (x) Fr Or) .l'* (x) - /, 


along C, 
along C, 


The symbol 0 is used to denote a matrix of zero elements, and the 
superscripts -f- and — are used to designate matrices of regular inner 
and outer functions respectively. 

If wc write Fo”Or) = F 0 ~(x) = 0, Qo*(x) = G 0 + (x) = /, and further¬ 
more 

I Pn * M = Fm * (X) " (x) “ F ”-'~ <*>' 

( ; / Q«+ (x) = Grn* (x) - Gm- 1 + (x), Q m - (x) = G n ~ (x) - G m -C (x), 
it is clear that the sequence of equations (12) is equivalent to 

( l Q«(x)-Q m '(x) = r-(x) ZV.r(x) ,l*'(x), (m = 1,2... .). 

Here the superscripts are employed as before. 

The form of equations (14) is such that we can determine P m + (x), 
F m -(x), Q m +(x), Q m ~M in terms of F„_r(x), Q n . x + (x) so that the 
with pair of equations (14) is satisfied. In fact the first one of the 
with pair of matrix equations may be broken up into wV ordinary 
equations 


n 

-- Pij.m-(x ) = t p (x) Y. qi.Km-i+Maxj (x) along C, 

X « 1 

(i,j = 1 , . . .,n). 
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where the third subscript on the functions corresponds to the subscript 
on the matrix. We have already obtained a solution of an equation 
of the form 

(x) — /" (x) = * p (*) QiXm-i* (x) a X j (x) along C 

(compare with (7)) in the form of a definite integral. By forming 
the sum of /+(x) and /-(x) for X = 1,. .,n we obtain for every i and j, 
elements (x) and CO which form the elements of P m + (x) 

and P m - % (x) with the desired property (14). 

Likewise we can break up the second one of the with pair of matrix 
equations (14) into n* equations. A solution may here be built up in a 
similar way (compare (7')- It must be observed that since the de¬ 
terminant of yJ(x) does not vanish along C, the elements of A (x) 
satisfy the conditions imposed on a(x) at the outset. 

Now if we recall the method of solution of (14), it is clear from 
(10), (10') that along C the maximum modulus of any element of 
P m ~ (x) or Q m * (x) does not exceed ruL*.j where L„_i denotes the 
maximum modulus of any element of J\,-f (x) or Q m .i+(x) along C, 
and t is arbitrarily small uniformly for all values of m. This relation 
may be expressed in the simpler form 

(15) L m ^ ruLm-i • 

The series formed by the elements in any ith row and jth column of the 
series of matrices 

Pc - Or) + Pr (x) + . . ., Q 0 + (x) + <?i* (x) + . . 

will therefore converge absolutely and uniformly provided that t is 
taken so small that nc < 1 . 

But the sums of m + 1 terms of these two series of matrices are 
F m -(x) and G n + (x) respectively, whose elements therefore converge 
uniformly to the elements of matrices F~(x) and G ,+ (x) of regular outer 
and inner functions respectively. If we recall that P 0 ”(z) = 0, and 
that the integral form of representation of each element of P m ~ (x) 
(see ( 2 )) makes each element of this matrix reduce to zero atx= oo , 
it is plain that at infinity F~(x) reduces to the matrix 0. 

If we turn now to consider (x) and (x) along C, w'e see from 
what precedes and from equations ( 12 ), that these matrices also 
converge uniformly along C, and therefore respectively within C and 
without C, to matrices F + (x) and G~(x) of regular outer and inner 
functions. Since G' 0 ~(x) = 0, it is clear that G"(x), as well as F"(x), 
reduces to 0 at x = go . 
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The matrices F-(x), F+(x), 6 '-(x), C*(x) thus obtained will satisfy 
( 11 ), as appears from ( 12 ) by letting n become infinite. 


§ 4. Application to the Solution of a Single Matrix Equation. 

Multiply the second matrix equation (11) on the right by 
r p (x)A(x) and subtract it, member for member, from the first equation 
(11). There results 

(16) F+ Or) - t p Or) [/ + G~ (x)] A (x) along C. 

Inasmuch as G~(x) reduces to a matrix of zero elements at x = 00 , 
the determinant of / + G~(x) and also of F+(x) is not identically zero. 

The matrix equation (16) admits of further simplification. In fact 
the function log r(x) is analytic along C and increases by 2ir V— 1 
as x makes a positive circuit of C. If c lies within C the function 

<t> Or) - log r (;r) — log (x-c) 

is accordingly single-valued as well as analytic along C. But <j>(x) is 
of the form 

t p (x) 0 + (x) a (x) (p = 0, g(x) = 1, a(x ) = <t> (x)), 


so that by (7) we can find 0 * (x) and 0“ (x) such that 
0* (x) — 0- (x) = <t> (x) along C\ 

this gives us 


r p (x) = (x-cY e p '-(x). 


Now let us write 

4> (x) = e-rm F+ (x), ^(x) = (x-c) p e p9 '^> [I + G~ (x)]. 


By these equations we define 4>(x) as a matrix of regular inner func¬ 
tions, and ^(x) as a matrix of functions analytic outside of C except 
for a pole of order p at x = co , and continuous along C; the determi¬ 
nant of neither ^>(x) nor Sk(x) vanishes identically. Between 4»(x) and 
^(x), by (16), we have the matrix relation 


(17) 


4> (x) = ^ (x) A (x) along C. 


It is this type of matrix equation which is important for the present 
paper. 
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§ 5. Further Properties of 4»(x) and ¥(*). 

It is necessary for us to investigate further the nature of any solu¬ 
tion <t> (.r), Sis' (x) of (17) in the neighborhood of the curve C. In the 
first place, it is to be observed that at points of C where all of the 
elements of A(x) are analytic, the same is true of the elements of <l>(x) 
and 'I'(x); in truth, the equation (17) shows us that analytic extension 
is possible across the curve at such points. 

We shall prove that the elements of these matrices possess derivatives 
of all orders, continuous at all points of C. 

Since the elements of ^l(.r) have line derivatives of all orders along 
C we may write, for t and y upon C, 

(18) A (t) - A (y) + (t — y) ~ A (?/)+.. . 

+ - ^ y)k (y) + «-v) k *«>v)> 


where the elements of B(t , y) are continuous functions of t and y 
along CP Also by Cauchy’s integral formula in matrix form we 
have from (17) for .r within C 


(x) 

dx*-' 


(- !)*-» (4. -1)! r *0) A(Q , 
2 7T V— l Jc(t—x) k 


(A- 



If we substitute the above expression for A(f) in this last equation, we 
obtain a number of terms of the form (save for a constant multiplier) 


(19) 




The integral is not altered in value if C is replaced by C 2 which lies 
outside of C. Therefore each of these terms represents a function 
analytic in x and continuous in y along C. There remains a single 
term not of the form (19), namely 


(_!)*-« 0fc - 


2ir V 




7 When a differentiation or integration sign appears before a matrix, it is 
understood to apply to each separate element of the matrix. The stated 
property of B{t, y) comes at once from the explicit formula 


_ % 1 r i /z-v\ k d k +'A(z) 
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If now x be made to approach a point x 0 of C, and if y be taken as the 
foot of the normal from x to C, this term approaches a limit which is 
continuous along C. In fact the factor (t — y) k /(t - x) fc remains finite 
for t along C, and approaches the limit 1 uniformly save in the vicinity 
of x 0 . Thus if x (and y also of course) approaches x 0 , d*“ l 4> (x)/dx* -1 
approaches a limit along C continuous for an arbitrary x 0 . 

A similar proof shows that 'F(x) has derivatives of all orders, con¬ 
tinuous outside of and along C. This proof is based on the fact that 
the elements of A~ l (x) satisfy the restrictions placed on the function 
a(x) in § 1- 

The above results also lead to the conclusion that at any point 7 of C 
at which one or more elements of A(x) fails to be analytic, the ele¬ 
ments of <!>(*) and 'F (x) admit of asymptotic expansion in a series in 
positive integral powers of x — 7 . This is an immediate consequence 
of an expansion like (18) for 4>(x) or SP(x) in which now t and y can 
be any points within or without C respectively, and D(t, y) is contin¬ 
uous in t and y.® 


§6. A Normal Form for <l>(x) and 'I'(x). 

By a series of simple normalizations it is always possible to obtain 
a solution 4>(x), 'I'(x) of (17) such that $( 2 ) is (as before) a matrix of 
regular inner functions, and 'F(x) is a matrix of function analytic, 
without C in the extended plane except for a possible pole at x = c©, 
and furthermore such that \ <t>(x) | does not vanish within or on C, and 
| >F(x) | does not vanish without C. 

This solution may be directly obtained from that found in § 4. 
If | 4>(x) | vanishes at x = c within C say, we can determine a matrix M 
of constants such that all the elements of the first row of M 4>(x) 
vanish at x = c, while | M | 5 ^ 0. Now M<t> (x), A/'k (x) yield a new 
solution of (17), which has the properties given for 4>(x), 'I'(x) in § § 4, 5. 
If we divide the elements of the first row of M 4>(x) by x - c, we obtain 
a matrix 4>(x) of functions analytic within C and continuous along C; 
if the same operation be applied to 't'(x), we obtain S^(x), a matrix 
of functions analytic without C in the extended plane save for a pos¬ 
sible pole at x = 00 . Moreover 4>(x) and ¥(x) yield a solution of (17), 
since the effect of this operation is to alter the matrix on either side 
only in the removal of a factor x — c from the first row. By this 
device we have diminished the multiplicity of the zero of | 4>(x) | at 

8 For the relation between the existence of derivatives and of asymptotic 
series see W. B. Ford, Bull. Soc. Math. France, 39, 347-352 (1911). 
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,r = c by one unit, without the introduction of further zeros of | <!>(.»•) | 
or | 'l'(.r) | in the finite plane. 

An entirely similar process eliminates a zero of | 'J'(j’) | without C, 
or a zero of | <f>(.r) | and |'l'(.r) | along C. In consequence of the 
results of §5, if either of these functions vanishes along C, the other 
does also, both at least to the first order. 

This process may be continued so long as there remain zeros of 
<I>(.r) or 'l'(.r). It must however finally come to an end. If this were 
not the fact it would follow at once that both ] 4>(.r) | and | 4'(.r) | have 
a zero of infinite multiplicity at some one point of C where an element 
of d(.r) fails to be analytic. But this cannot be the case, for let <!>(.r) 
and i(x) be a solution of the following matrix equation 

(20) i Or) - -!-(*) * M along C, 

where the elements of <t>(x) and 'l'(.r) are restricted like ‘K.r) and Sk(.r) 
were found to be in § 4. The existence of such a solution becomes 
manifest by a mere interchange of the role of rows and columns in 
what precedes. Now from (17) and (20) we conclude that 

<t> Or) • i Or) = * Or) ! •. * Or) along C. 

The function represented by either side of this equality is not identi¬ 
cally zero; and it appears from the two representations that it is 
analytic in the finite plane, and analytic or with a pole at infinity. 
Hence this function is a polynomial, and the multiplicity of the zeros 
of either | <I>(.r) | or | *0r) | at any point of C is finite. 

When the process comes to an end the following result has been 
obtained: if the elements of .1 (x) are unlimitedly differentiable along 
C, analytic save at a finite number of points of C, and if |/l(.r)| is 
not zero along C, there exists a solution «!>(.r), *(*) <>f the equation (17) 

<P (x) = ¥ (x) A ( x ) along C, 

in which the elements of 4>(x) are analytic within C, unlimitedly differ¬ 
entiable along C, and 4 >(jt) is of determinant not zero within or on C; 
and in which the elements of ¥(*) are analytic without C in the ex¬ 
tended plane save for a possible pole at x = c© , unlimitedly differen¬ 
tiable along C, and *(*) is of determinant not zero without C. 

Here the point x = <* appears as an exceptional point. It is evi¬ 
dent that an arbitrary point x — a not on C may be used to take the 
role of the point at infinity. In fact a may also be taken to be a point 
of C. When this is the case, the elements of <£(•*•) and ^(x) are finite, 
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or become infinite to finite order at .r = a. To obtain such a solution 
4>(.r), 'P(.r) it is only necessary to divide each element of $(x) f 'PM 
bv a suitable power of .r — a, so chosen as to make each element of 
these matrices analytic at x = co ; and then to apply the normaliza¬ 
tion above indicated, letting l/x — a replace x. The curve C can also 
be taken to be a simply closed analytic curve which passes through 
*= oo. 

The main part of the conclusion that has been deduced above was 
obtained by Hilbert and Plemelj (loc. cit.) with the aid of the Fredholm 
theory of linear integral equations. Independently of their work, I 
treated a special case (see I, § 1) which arose in a different form in 
connection with my study of the irregular singular points of ordinary 
linear differential equations. 

My proof in this special case suggested to me the above treatment 
of the general case by the method of successive approximations. 
The restrictions here placed on the elements of A (.r) and on the curve 
C are not essential to this treatment, and I have very little doubt that 
these may be replaced by the weaker restrictions of Hilbert and Plemelj. 
Nevertheless I have been content to use the simplest restrictions 
consistent with the applications in view. 

A second proof in the special case has recently been given by me, 
Math. Ann., vol. 74 (1913) pp. 122 (see also Bull. Am. Math. Soc., 
vol. 18, 1911, p. 04). This second proof, which suggested itself to me 
at about the same time as the first, is practically the same as that given 
by Hilbert and Plemelj. To my considerable regret this relation¬ 
ship escaped my observation until it was too late for me to make 
suitable reference. 


§ 7. The Preliminary Theorem. 

The following is an extension of the preceding results which is 
convenient for the applications: 

Preliminary Theorem. Let C u ..., C r be r simply closed analytic 
curves in the extended complex plane. I^t -4i(x),. . ., A r (x) be matrices 
of functions defined and unlimitedly differentiable along C i,...» C r 
respectively, analytic save at a finite number of points of these curves and 
of determinant not zero. If furthermore at any point of intersection of 
C a , Cs, the matrices A a (x), A#(x) arc such that the formal derivatives of 
all orders of the matrix 

(21) A a (x) A 0 (x) — Afi (.r) .4 A (x) 
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vanish, there exists a matrix 4>(x) with the following properties: 

(1) each element of 4>( x) is analytic except along C\,.. C r and at an 
arbitrary point x = a where the elements may become infinite to finite 
order; | <t>(x) | nowhere vanishes save ]>ossibly at x = a; 

(2) the elements of 4>(x) are continuous and unlimitedly differentiable 
along each curve C t from either side, analytic from either side save at 
points of intersection of the curves, or at those points where an element of 
Ai(x) fails to be analytic, or at x = a; if a lies on a curve C„ the matrix 
( x - a) 1 ^<(x) [or aT l A { (x) if a = co ] is unlimitedly differentiable along 
Cifor a suitable l. 9 

(3) if a + and — side of each curve C< are chosen, then 

* (x) = [,2?,- * (*)1 Ai (X f ) (t = 1.r), 

where the approach to the arbitrary point x< of C t is along the + and — 
side respectively . 

Let us begin by establishing the theorem in the case when a is 
not a point of C\,..C r . It has already been established for r = 1 
(see § 6), with the single notational difference that <I>(x) was replaced 
by either of two matrices 4>(x) and * (x), according as x was within or 
without C. To establish the theorem then, we need only show that 
if it is true for r^k it is also true for r — A: + 1, when the theorem 
follows by induction. 

Assume that (x) is the solution for r = k, and for the matrices 
Cl,..., c k , A\(x),..Ak (*), where C lt ..and A x (x),.. A M (x) 
satisfy the requirements of the theorem for r = k + 1. Let us sup¬ 
pose for the moment that a solution 4>ui (*) exists with the desired 
properties. If we write 

(22) i (x) - U (x) (x), 

the following facts are clear from (1), (2), (3) of the theorem: (!') 
each element U (x) is analytic except along C\,. .., C*+i.and at the 
specified point a, where its elements may become infinite to finite 
order; | U (x) | nowhere vanishes save possibly at x — a; (2') the 
elements of U(x) are continuous and unlimitedly differentiable along 
each curve C, from either side, analytic save at points of intersection 


9 A function will be termed unlimitedly differentiable at x = oo if when we 
write x = \/x’, the function of x' obtained by the substitution is unlimitedly 
differentiable at x' = 0. 
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of the curves or at those points where an element of A t (x) fails to be 
analytic; 

U (i) = U (x) (t = 1,.. k), 

(3,) .*!£,♦ CT (*)♦*(*) = [*,!£,- U &*“ (*)1 (**)• 

The condition (3') necessitates that U (x) is analytic at any point 
Q f Ci(i= 1 ,. . k) and throughout the extended plane save at a and 

at points of Cm- The condition (3') gives us in addition 

u{x) = **-'(**♦»). 

Conversely if U(x) satisfies these conditions (l')> (2'), (30 it is ap¬ 
parent that4> A+ i (x), given by (22), will satisfy the conditions prescribed 
for 4>(x) in the theorem. 

But in view of the above simplification of (30» the conditions (l')» 
(20, (30 on U( x ) are precisely those of the theorem on 4>(x) if we take 
r=l, C - Cm, .Ai(x) - 4>*(i)i4* f i(z)4>r l (i). To complete a 
proof we need only show that this matrix fulfils the conditions pre¬ 
scribed in the theorem along Cm* for then a matrix U(x) will exist 
which satisfies these conditions. 

The elements of the matrix 

(23) (x) Am (x) «f»r l (x) 

are analytic at points of C*+i which are not points of intersection or 
contact with C\ t . . .,C*, or points at which an element of ^4* + i(x) fails 
to be analytic; at these latter points the elements of 4>*(x) are analytic 
so that the above matrix is unlimitedly differentiable in the neighbor¬ 
hood of such a point. The determinant of this matrix is equal to 
| Ak+ i(x) | and nowhere vanishes along Cm- 
It remains only to examine the elements of the matrix near points 
of intersection or contact of Cm with one of the curves C \ t .. - ,C*. 
Suppose first that intersects a single curve C, at such a point. As 
x moves along the curve and passes from the positive to the 
negative side of C it the matrix 4>* (x) changes to <l>* (x) A { (x), so that 
the matrix (23) changes to 

$>* (x) At (x) Am (*) Ac 1 (x) S*" 1 (x). 

Bearing in mind the condition of permutability imposed on 
A\ (x),.. ., A, (x) at such a point of intersection (see (21)), it becomes 
apparent that the elements of the matrix (23) are continuous at this 
point, and have equal backward and forward derivatives of all orders 
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at the point. Thus the elements of (23) are unlimitedly differentiable 
in the neighborhood of such a point along C**i. The same is true at 
more complicated points of intersection or points of contact, as a 
similar argument shows. 

The theorem is now demonstrated, at least for the case when a 

does not lie on a curve C u _ C t . If a does lie on such a curve, we 

first choose an a which is not on these curves to replace a, and then 
resort to the simple device used at the close of § 6 to replace a' by a. 

Part II: The Problem of IIiemann and its Generalization. 

§ 8. On Cauchy Matrices. 

Let T be a matrix of constants of determinant not zero, correspond¬ 
ing to the coefficients of a linear transformation. According to the 
well-known theory of classification of such matrices, based on the 
Cayley-Weierstrass elementary divisor theory, we may write 

T = C-'I'C, 

where C is a matrix of constants, and in general V is of the form (p>5,^). 

Consider along with the matrix /', the matrix of functions 
/'(x) - (x%) where 2irk^—\ = logp, (; = 1,..., n). When x 
makes a positive circuit of x = 0, /'(x) becomes /'(x) V. The matrix 

T(x) = /'(x)C 

will accordingly alter to 

/'(*) VC = I(x)C C-'I'C - T(x) T , 

when x makes such a circuit, i.e. T(x) will be affected by the prescribed 
linear transformation. 

Such a matrix T(x ) or a simple modification thereof exists for every 
transformation T and is called a Cauchy matrix. 10 A certain measure 
of arbitrariness enters into the determination of T’(x) when T is given; 
in general k\, .. .,k n are undetermined up to an additive integer. 
We note that the determinant of T(x) is not zero for x ^ 0, oo. 

A final property of the Cauchy matrices which is important for us 
is that they form the matrix solution of a linear differential system 

i ^M = L r ( x ) . 


10 Cf. Schlesinger, Vorlesungen fiber lineare Differentialgleichungen, pp. 
122-140, where a complete treatment is given. 
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where I is a matrix of constants. In fact we have 

dr Or) _ dl Or) c 
1 dx~ ~ x ~*r 

But the matrix .r dl'(x) / dx may be written as 

KV Or), K = (Mo)- 

Thus we obtain 

* = KI' (x) C = KC- C-'r (x) C = LT (x). 


§ 9. The Monodromic Group Problem. 

The most elementary existence theorems for ordinary linear differ¬ 
ential equations show that the linear differential system 



= R(x) T(x) 


admits a matrix solution Y(x) whose elements are analytic in the 
finite plane at every ordinary point where the elements of K(x) are 
analytic, and analytic at infinity if the elements of R(x) vanish to at 
least the second order at the ordinary point infinity; furthermore 
j y(x) I is not zero at such points. All other points of the plane are 
called singular points in contradistinction to the ordinary points. A 
finite singular point at which the elements of R(x) are analytic or 
have a pole of the first order, or the point x = co if each element of 
R(x) vanishes to the first but not always to the second order at 
that point, is termed a regular singular point. 

Suppose now that the elements of R(x) are rational, and that all of 
the singular points a„..., a* are regular. These will be taken to lie 
in the finite plane. It is easy to show that the elements of /(*) be¬ 
come infinite to only a finite order at x = ai,.. . ,a m . 11 W hen x makes a 
positive circuit of one of these points, T(x) changes to y(x)7\, where 
T t is a matrix of constants and \T { \^ 0; in fact the most general 
solution is of the form Y(x)T where T is an arbitrary matrix of con¬ 
stants for which | T | ^ 0, and after the circuit is made in the x-plane, 
Y(x) is still a solution of (24). 

If we start from a point x = c and make a circuit of ai,. - ,(*m in 


11 Cf. Trans. Am. Math. Soc., 11, 199-202 (1910). 
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such wise that the combined circuit is reducible to a point, it is clear 
that 

(25) . T, = /, 

a necessary relation between the matrices T\ t . .., T m . 

The problem of Riemann is the following: For assigned points 
Oi- »a m and assigned matrices 7*i,.. T m for which (25) holds, to 
construct a matrix F(ar) of functions of determinant not identically 
zero, with elements analytic save at a lf . . .,a*, where these elements 
are finite or become infinite to finite order, and undergoing a transfor¬ 
mation to Y(x) Ti as x makes a positive circuit of a<(i «= 1,. .., m). 

A SQlution of this problem has 
been given by Hilbert and com¬ 
pleted by Plemelj (loc. cit.). It is 
possible to obtain a more simple 
solution on the basis of the pre¬ 
liminary theorem. 

Let us surround <Z| t ...,a n 
by small non-overlapping simply 
closed analytic curves C i, ..., C m 
and let us pass through a u ... ,a„ in 
cyclical order another closed ana¬ 
lytic curve D which meets each 
curve C { only twice (Fig. 2), in and m, say. 

Choose matrices A\.. .,A mt A m +\= A\ of constants such that 

(26) Aui~'Ai = Ti (t = 1,. . . ,m). 

Here Ai, for example, may be taken at pleasure and A if . . .,A m are 
then determined. Define a matrix A(x) along D to be equal to A { 
on that part of the curve which lies between and Ci+l l^m+l = Cl) 
and equal to 

(27) (t=l . m) 

for x on the part of D within C t . We will suppose that /<, m* and D 
were so chosen that D does not pass through one of the finite number 
of point for which the determinant of any matrix (27) vanishes. 

The matrix A(x) as here defined is continuous along D, analytic 
save at the points where D intersects C\, ..., C m , and of determinant 
not zero along D. 
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Bv slightly modifying each element of A(x) along a small segment 
near either end of the arc of D within C< for i = 1,.. -, m, it is possible 
to obtain a matrix A(x) whose elements are analytic save at the en 
points of these segments, and unlimitedly differentiable along U . 
Such a matrix A(x) satisfies all of the conditions necessary for the 
application of the preliminary theorem in the case r = 1. 

Taking D as the curve and A(x ) as the matrix, we can affirm the 
existence of a certain matrix 4>(x), possessing in particular the property 

(28) ,»:♦*(*)- [-2?-*(*)]-4 w. 

the approach to the point xi of D being from within and without D 

respectively. Let us choose a = a m . 

If we extend 4> (x) analytically across D between C { and G,+i, 
it becomes <t>(x)A { by (28). If we extend this matrix analytically 
back across D between C i+ i and C i+2 (C m * 2 = C 2 ], it becomes 
PKsMf+r 1 ] A r l > or *(*) T * That is ’ l . he ma . trix U M obtained 
from 4>(x) by analytic extension is analytic outside of C u . ..,C m and 
undergoes a transformation to U(x)T { when x makes a positive cir¬ 
cuit of a { . Furthermore the determinant of U (x) is not zero outside 
of Ci C m 

Denote by Z<(x— a t ) the Cauchy matrix belonging to the trans¬ 
formation T it so that Zi (x — a { ) undergoes a transformation to 
Z ( (x - a { ) T t as x makes a positive circuit of a,. Write 

(29) Y (*) - Z (x) U (x), 

where Y(x ) is the solution of the Riemann problem to be constructed. 

The elements of Z(x) must in the first place be single-valued and 
analytic outside of C h . . - ,C B , since U(x) undergoes the same trans¬ 
formation as that prescribed for Y(x) about the points a u .. .,a mt and 
I U(x) |^0. 

Furthermore within C„ the elements of Y(x ) are to be analytic ex¬ 
cept at a, where they may become infinite to finite order. Hence the 
elements of the matrix Y(x)Zr l (z—a t ) must be single-valued and 
analytic within C„ by the definition of Z,(x—a,), except for a possi¬ 
ble pole at x = a,. Along C, this matrix may be written 

(30) Zi (x) = Z (x) [C(x) Zr l (x — di)) (i = 1 , 2, ... m). 

If we write 4>(x) = Z(x) outside of Ci,.. .,C m and also 4>(x) = Z,(x) 
within C for i = 1 ,.. .,m, the equations (30) may be written 
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(31) x "%+ <*> (x) = [,!!“- *t> (x)] Ai c Xi ), Ai (x) = U (x) Zr l (x-Oi). 

(i = 1. • • rn). 

This suggests another application of the preliminary theorem, since 
the curves C\,...,C m and the matrices Ai(x),..., A n (x) of known 
functions satisfy the necessary restrictions. 

_ Let 4>(x) be the solution given by the theorem for a = a m , and let 
Z i(x),. . .,Z m (x), Z(x), be defined as equal to 4>(x) within C i,. . C m 
and outside of these curves respectively. These functions will then 
satisfy (30). Let )'(x) be defined by (29). This matrix is clearly 
composed of elements analytic without and along CC m , as are 
those of U (x); within C„ Y(x) continues analytically into 
Z { (x) Z,(x — a,) by (30) for i = 1 and consequently its ele¬ 

ments are analytic throughout the plane except possibly at ai,.. .,a m ; 
its elements become infinite only to a finite order at a, since the 
elements of Z^x - a,) become infinite only to finite order at o f . 
Furthermore by (29) }'(•*■) undergoes a linear transformation to 
Y (x) T t as x makes a positive circuit of a i . Thus the lliemann prob¬ 
lem has been solved. 

It is worthy of note that | Y (x) | does not vanish for .t ^ a { 
(i_= 1, ...» m) 1 _ This is an immediate consequence of the fact that 
I 2,(x) | Z m (x) |, | Z(x) | do not vanish in their regions of defini¬ 

tions save at these points, and of the fact that the Cauchy matrix 
Z t (x — o,) has a determinant which does not vanish save possibly at 
x = a, and x = co. 

§ 10. A Generalization. Equivalence. 

A more general result can be deduced exactly as the results of § 9 
were obtained. Let us say that two matrices of functions Y i (x) and 
F 2 (x) whose elements are analytic in the vicinity of x = a, but not 
in general single-valued or analytic at x = a, are properly equivalent 
at x = a if we have 

Vi(x) = A(x)Y 2 (x). 

where A(x) is composed of elements single-valued and analytic at x=a, 
of determinant not zero there; if this condition is not satisfied, 
but if the elements of -4(x) have a pole or are analytic at x = a, let 
us say that Y i(x) and F 2 W are improperly equivalent at x = a. 

This definition is convenient for the statement of the following 
result: Let a lt ..., a„ be m given points; let T\ t ..., T m be matrices of con¬ 
stants such that T m T m - 1 .. .T\ = /; let Z\ (x), ..., Z m (x) be matrices of 
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functions analytic of determinant not zero in the vicinity of a, and under¬ 
going a transformation to Zi(x) T u .. .,Z m (x) T n as x males a positive 
circuit of a\ t . .., a» respectively. There exists then a matrix Y(x) of 
functions of determinant not zero for x j* a u ..., a m and analytic 
save at these points, which undergoes a transformation to Y ( x) T { as x 
makes a positive circuit of a, (£ = 1,.. .m); furthermore Y(x) is prop¬ 
erly equivalent to Z x (x),. . .,Z B _,(x) at a,,..a*,-! and properly or im¬ 
properly equivalent to Z n (x) at a m . 

The result above stated is obtained when we replace the Cauchy 
matrices Z ,(x— a,),.... Z n (x— a„) of § 9 by matrices Z,(x),.... Z m (x) 
having the properties specified. The line of attack is identical with 
that given in § 9. The facts concerning equivalence follow at once 
from the relations analogqus to (30): 

(32) y (*) ■ Z i (*) Z i (*) (t = 1,.. m). 

Here Zfx) is composed of elements analytic at x = a* (i = 1,.. . ,m—1), 
and of determinant not zero there; also Z m (x) is composed of elements 
analytic at x = a m or with a pole at that point. 

§ 11. Final Form of Solution. 

There is a certain lack of symmetry between the role of a,,.. ., a m 
in the solution of the Riemann problem obtained in § 9, provided the 
given Cauchy matrices Z t (x - a,),..., Z m (:r - a m ) were taken in the 
most general possible form. We shall now proceed to show that if 
(j; __ a m ) be a properly chosen Cauchy matrix associated with the 
transformation T m , the equivalence of Y (x) and Z n (.r) can be made 
proper at x = a m also. 

A form of reduction that is well-known suffices to establish this 
fact.I 2 Nevertheless, inasmuch as a similar type of reduction is neces¬ 
sary later in the present paper, I give this reduction herewith. 

Let us begin with the matrix )'(x), obtained in § 9, which we may 
assume to be improperly equivalent at a„ to the Cauchy matrix 
Z m (x — a m ). Now we have (§ S) 

Z m (x-0 = /'(x-aJC 

so that from (30) 

Y (x) C-i = Z m (x) I'(x - a J, 


12 Plemelj, loc. cit., pp. 237-210. 
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The matrix of elements on the right can he written in the form 
(x— a m ) k ' (fln+ bi\(x — a m ) + ...).•••. (*—a.)*«(ai. + 6, n (x-a m ) + ...) 

.. 

(x—an)*'(anl + 6nl(j:— dm) + • • •)*• • •» ( X — am) kn ((lnn+h nn (x a «) + • • • ) 

Here it is supposed that the exponents of the Cauchy matrix do not 
differ by integers; but a similar form can be found in all cases. In 
each column of Y(x) the highest possible power of x — a n is exhibited 
which leaves the coefficients of this power analytic in character at 

If | | 0 we can write this matrix in the form /!(*)/' (x — a n ), 

where /' (x) is the matrix (x*>5„), and where A (x) is the matrix ob¬ 
tained from (33) by striking out the exhibited powers of x — a. From 
the equation 

F(x) = A (*)/'(*-a.) C- 1 

we see that Y(x) is properly equivalent to a Cauchy matrix belonging 
to T at x = a*. 

On the other hand if |<fy| = 0 we proceed as follows: It is readily 
verified that if )'(x) be one matrix solution of the Riemann prob¬ 
lem satisfying the relation of equivalence given in § 10, then 
DY (x) (| D | 5 * 0] is also a solution. Consequently it is no restric¬ 
tion in the consideration of (33) to assume that 

an “ flu - •. = a u = 0* 

since when | | = 0 this relation may always be made to hold 

by multiplying on the left by a suitable matrix D. Denote by 
yi(x),...,y n (•*•) the elements of the first row after a factor (x — a m ) 1 
has been removed, where l is the exponent of the highest power of 

x _that may be taken out and leave the elements y x (x),..., y n (x) 

of the respective forms 

( x -a m ) k ' (fx + Oi (x-a.)+ a.)*«(/. + ^(*—«•) + ...) 

It follows that are not all zero. 

We inay now add constant multiples of y x (x),.. ., y n (x) to the succes¬ 
sive rows of l*(x) and obtain a new matrix Y{x). In fact this is equiva¬ 
lent to multiplying Y(x) on the left by a matrix 
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0 

0 


1 , 0 , 

C2 


PM = ’ 


(* - Clm) 


1 , 


Cn 


I (a- — a m ) 


b 0 , . 


.Inasmuch as | PM | = 1, and the elements of P(x) are analytic save 
at x = « m , this modification cannot affect any of the properties already 
secured for Y(x). However by choosing c 2 ,.. ., c m properly we can 
clearly make all the coefficients a*,... a** vanish at x = a m pro¬ 
vided U t* 0. In this manner we can increase an exponent k t by 1 
without altering the determinant of Y(x) except by a constant factor. 

Inasmuch as | Y(x) | does not vanish identically, a succession of 
steps of this type will finally bring to light a solution Y ( x ) for which 
y(j) C has the form (33) and in addition | | ^ 0. When this 

stage is reached F(x) will be properly equivalent to a Cauchy matrix 
belonging to T n at the point a„. ... 

When Y(x) has thus been given a normal form, it is the solution 
of a linear differential system (24) with regular singular points at 
x = a u .. .,a n and having no other singular points, as may be at once 
proved. 13 The elements of R(x) therefore have the form of rational 
functions whose numerators are polynomials in xof degreeatmost mi — 2, 
and whose denominators are the product of ( x — ai), .. ., (* — a m ). 


§ 12. Irregular Singular Points and Canonical Systems. 

The Cauchy matrix is the simplest possible matrix of functions to 
which a matrix solution of a given linear differential system is properly 
equivalent at a regular singular point. Let us determine the simplest 
possible matrix Z(x) to which the matrix solution Y(x) of a given linear 
differential system (24), in which the elements of R(x) need not be 
rational, is properly equivalent at a prescribed irregular singular point. 
It is convenient to take this point to lie at infinity. If the highest 
order of any pole of an element of xR(x) at x = oo is p + 1 (p = 0), 
then p-\- 1 is said to be the rank of the singular point .c = «>. 

According to the results of § 10 we can find a matrix Z (.r) which at 
x = od is properly equivalent to )’ (x) and at another point x = 0 

13 Schlesinger, loc. cit. pp. 215-221. 
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is improperly equivalent to a Cauchy matrix which at x = 0 undergoes 
a transformation inverse to that which }’ (x) undergoes at .r = oo. 
Here wc take m = 2, a i = 0, «e = 00 . The condition l\T\ = I is 
satisfied. 

By means of a modification precisely like that of § 11 wc can make 
Z(x) properly equivalent to a suitable Cauchy matrix at x = 0, and 
yet preserve the other properties listed in § 10. 

Now consider 


AW 


dZM 

dx 




Since | Z | (x) | ^ 0 for x ^ 0, co, and since Z(x) and dZ(x)/dx 
undergo the same substitution about .r = 0, the elements of ll\(x) arc 
single-valued, and analytic for x y* 0, co. Since Z(x) is properly 
equivalent to a Cauchy matrix at .r = 0, the elements of H\(x) have 
poles of at most the first order at x = 0. 14 Moreover since Z(x) is 
properly equivalent to Y(x) at x = wc have 

Z(x) = A(x) Y(x), 

where the elements of J(x) arc analytic at x = °o and also | .l(.r) | y* 0 
at x = oo . Therefore wc obtain 


= A (x) " Y (.r) = [.1 W R W + g] Y <*>. 

and 

Rl {x) = dZ dT z ~' W " [ A w <*> + (*>• 

Hence the elements of Jii(x) are analytic or have a pole of order not 
greater than p at x — oo. 

From this analysis it follows that xR\(x) is a matrix of polynomials 
of degree at most p + 1, so that Z(x) is itself the solution of a linear 
differential system 

(34) x d £=P(x)Z, 

where P(x) is a matrix of polynomials of degree at most p -f 1 • This 
is the canonical form of equation with an irregular singular point of 
rank p + 1 at x = go . 

At a finite singular point x = a, the canonical system is of a type 
obtained from (34) by a transformation x' — a = l/x. 

14 The detailed proof is entirely similar to that given herewith to determine 
the nature of the elements of 7? (x) at * * oo. Cf. Schlcsinger, loc. cit., 
pp.143-144. 
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In order not to introduce artificial difficulties we shall consider the 
regular singular point to be of rank zero. To this case the above 
argument applies also, and the canonical system is that satisfied by a 
Cauchy matrix. 

If Y(x) is the matrix solution of a differential system (24) in which 
the elements of R(x ) are rational, the above argument shows that at 
each of its singular points a u ...» n m , Y(x) is properly equivalent to 
the matrix solutions Z x (x) t . .., Z m (x - a m ) of canonical differential 
systems. 

Conversely the results of § 10 lead to the conclusion that given 
T u . . T m such that T m T m - X . .. T x = /, and canonical differential sys¬ 
tems belonging to the singular points a x ,..., a m with matrix solutions 
Z x (z),..., Z m (.r) undergoing a transformation to Z x (x)T\, • • • > ZmW T m 
at these points respectively, there will exist a matrix solution Y(x) 
of a rational differential system (24) which undergoes a transformation 
to Y(x) Ti as .r makes a positive circuit of a, and which is properly 
equivalent to Z { (x) at this point, for t - 1,..., m — 1, and properly or 
improperly equivalent to Z m ( x) at x=a m . 

It is therefore essential to obtain a characterization of the matrix 
solution of a canonical system (34), and further to solve the associated 
inverse problem, before solving the general problem of characteriza¬ 
tion for a system (24) with irregular singular points. 

§ 13. The Problem of the Irregular Singular Point. 

In my paper referred to 15 I characterized the nature of the matrix 
solution of a canonical linear differential system (34), at least in the 
case that the roots of a certain characteristic equation were distinct; 
the case of equal roots introduces complications of an algebraical 
nature, and is put to one side in the present paper. 

The results which I obtained may be recapitulated as follows: 
If the singular point is taken at x = co, there exists a formal matrix- 
solution of (34) 

S(x) = (e p j {x) x*i8ij (*)), 

x p + l x p 

(35) Pi (x) = a> + A —+••• + (j = 1,, n), 

Sij (z) = Sij + i (i, j = 1,. . n). 


is I. §§ 6, 7. 
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where | sq | ^ 0. The quantities at,.. ..a, are the roots of the char¬ 
acteristic equation alluded to, and the series s (i (x) are in general di¬ 
vergent. We shall assume for the time being that no three of the 
points a,,. . .,a n lie on the same straight line in the complex plane. 
Let now r,, . ..,r v denote the S = n (n — 1) (p + 1) arguments 
in order of increasing angular magnitude such that for some j and 

k (j t* k) 

(30) I (a> — a*) x p * 1 ) = 0, arg x = r*. 

Here “ 91 ” denotes “ the real part of”. Let us write r N , x = ri + 2 t r, 
and let j n and k m denote the value of j and k corresponding to m, so 
ordered that the real part (36) changes from positive to negative 
as arg x increases through arg x = r m . There exist then A matrix 
solutions Z x (x),. . Z.\(x) such that for i = 1, - • A 

(37) Z m (.r) ~ S (x), r m ^ arg x < r**,, 1 ® 

and such that along argx = T m +\, Z m .\(x) and Z m (x) differ only in 
their j m th column, the j m th column of Z m ,\ (x) being obtained from 
that of Z n ( x) by the addition of the A-.th column of Z m (x), affected 
with a suitable constant multiplier c m , to the j.th column. As a 
matter of definition we take 

(38) Z,v., (x) = Z, (x) /' = «,). 

The proof of the existence of Z x (x),. . Z s (x) having these proper¬ 
ties can be directly based on the existence of a solution Z (x) asymp¬ 
totically represented by S (.r) along every particular ray. 17 

The properties so far stated are characteristic of the behavior of 
the matrix solution not alone of a canonical system but of any system 
with singular point of rank p at x - <*> in the neighborhood 
of the singular point, 18 and are invariant under a transformation 
Y(x) = A (.r) )'(•*•) where A ( x) is a matrix of elements analytic at the 
singular point in question and of determinant not zero there. 

When Zi(x),. .., Z s +iW are in addition the solution of a canonical 
system, the matrices Z m {x) are analytic in the finite plane for .r^O 

16 The relation "z (x)~s (x). arg x = <r ” means in the present paper that 
z (x) is asymptotically represented by six) in some sector (however small ) Inal 
includes arg x = <r as an interior ray. This slight modification of the conven¬ 
tional meaning of the symbol “ ~ ” and its natural extension to matrices is 
convenient for the present paper. 

17 I, §6. 

18 I. §6. 
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and of determinant not zero. At x = 0 these matrices are properly 
equivalent to a Cauchy matrix. These further facts come at once 

from the form of (34). % 

Conversely let us prove that if Zi (x), ... Z N (x) exist possessing 
such properties, they are all matrix solutions of one and the same 

canonical system (34 ). 19 

In the first place it is clear that the matrix 

P(x) = x Z m ' (x) (m = 1.A') 

is defined in each sector r m g arg x < r mt i (i = 1,...,A) and hence 
is defined in the entire plane. Now Z m + 1 (x) is obtained from Z m (x) 
by multiplication on the right by the matrix 

I + C m , 

where C m is a matrix of zero elements except for the single element c m 
in the k m th row and j„th column. It appears therefore that P (x) is 
single-valued and analytic along each ray arg x = r m \ for we have 
along this ray 

dZm ? —Z m ,r' (x) = dZ } {x -- 1/ + C„] [/ + CJf'Zw' (x) 


Moreover, on account of the equivalence of Zi(x),.... t Z m (. r) to a Cauchy 
matrix at x = 0, the elements of xP (x) are analytic at x = 0. 

Secondly, in the neighborhood of x = oo, the matrix P(x) is asymp¬ 
totic to 

(39) x S- 1 (x), r m g arg x 

where the meaning of the notation is manifest. It is to be recalled 
that the relation (37) holds (by definition) in a small sector including 
arg x = r m as an interior ray. This enables us to write 


dZ m (x) dS(x) 
dx ~ dx 


r m <! argx < r w+ i. 20 


19 I stated this fact without proof earlier; see I, p. 468. 

20 Ford, loc. cit. 
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Thus P(x) is represented asymptotically in the complete vicinity of 
x = co l>y (39). But we have formally 

,40, + 

If (35) and (40) be used in evaluating the expression (39), it is seen 
that each element of P(x) is given asymptotically bv a power series 
in descending integral powers of x, with leading term in ■t 7 ”' or lower 
power of x. 11 follows that the elements of PU) are analytic or have a 
pole of at most the (/>+ l)th order at x * oo. Hence the elements of 
P(x) are polynomials of degree at most p + 1. 

The central problem of the irregular singular point is, for a given 
choice of pM t ..., //,(*), r,,..., r„ and of r u ..., c N above described, 
to construct a matrix l’(.r) with the above specified properties. 21 


§ 14. Solution of tlic Problem of § 13. 

In order to solve the problem just stated we make an application of 
the preliminary theorem of Part I. taking r = .\ 2 and for the curves 
Ci,. . ., Cijv wof the .V 2 straight lines formed by the .V rays arg x = r m 
(m = I, .... .V) but by the .V rays arg .r == r'„ (nt = 1, .. ., A ), which 
are obtained from them by a slight rotation « in a clockwise direction 
in the complex .r-plane. It is evident that if < be taken small 
enough we shall have 

Zm (x) ~ 6' (.r), t* ^ arg.c ^ r'„*i, 

for m — 1,..-V. Furthermore we shall have for any m 

9* (aj„ — at m ) * p * 1 > 0. arg x = r'*, 

for in = 1,..., *V. This fact is essential to the solution. 

The matrices Ai(x) t . ..,'!)*(*) which are to be used in the applica¬ 
tion of the theorem are defined as follows. W rite 

T(x) = (xWx'jSij), 

and then put 

( 41 ) A m (x) = T(x)\I+ C m \T~'(x) (m = 1 ,. . ., N), 


21 Cf. I. § 7. 
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where x in A m (x) is taken along arg x = T n , and the determinations 
of T(x) chosen are obtained from one jinother by allowing arg x to 
increase from r I to t n . The matrix A m (x) thus defined is analytic 
along its line save at x = 0. Also .1* ( x ) — / is a matrix whose only 
non-zero element is 

Now we have 

x p*\ x p 

(.r) - Pirn (*) = (»*- - «») ^TT + 0*- - 0 ” y p 

+ ... 4- (X*m — 

which quantity by definition of j m , h n has a negative real part for x 
along the ray arg .r = t n , at least when | x | is sufficiently large. In 
consequence of the form of this non-zero element it is certain that 
A m (x) is unlimitedly differentiable along the ray at x = <*> and be¬ 
haves there like the matrix /. As this is true along each of the rays, 
the matrices A m (.r) clearly satisfy the conditions of the theorem in the 
vicinity of x = <». Choose -li(x) # ..xl$tf(x) along_the straight lines 
Cw • •» Ctf /2 as equal to the corresponding matrix yJ m (.v) along either 
of its component rays outside of some circle | x | = r, within which 
they are chosen so as to satisfy the conditions of the preliminary 
theorem. Since A\ (x),..., An (x) are of the nature above described 
a t , T = co, the matrices A t (x),.. .,A/// 2 (x) are unlimitedly differen¬ 
tiable, and satisfy the permutability condition (see (21)) of the pre¬ 
liminary theorem. It is therefore possible to make such a choice 
of Ai (x), ... As /2 (*)- 22 

It follows by the theorem that there exists a matrix <I>(x) of determi¬ 
nant nowhere zero save possibly at x = a = 0, analytic except along 
the rays arg x = r m ' and such that 

(42) ,- U “ + * (*) = * (*>] An (x»), 

where x m is a point of arg x = tJ for which | x m r. It also follows 
from the theorem that each element of 4>(x) is represented asymptoti¬ 
cally by a series in negative powers of x in each sector (t‘ mi X m» 1 ) i 
since A m (x)*» /, these series are the same in all the sectors and we may 
write 

( 43 ) •*•(*) ~ (*«(*)). 

where s t j(x) is a series of negative powers of x in which the determinant 
of the constant terms is not zero. 

22 Cf. § 9. 
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Now write for m = 1,..-V + 1 

Z m (.r) « *(*) T (*) (r' m g arg x ^ r'-i). 

From (41) and (42) there results along arg x = r ' m +1 
Z M (x) = Z m (x) [/ + C m ], | x | = r. 

The functions Z m (x) may accordingly be continued analytically across 
each ray arg x = T m and represent matrices analytic for | x | ^ r, of 
determinant not zero. The relation (43) leads to the conclusion that 

Zm (x) ~ S (x) (Tm ^ arg x ^ r'm+x). 

where S(x) is of the desired form (35), and thence to the conclusion 
that this asymptotic representation is valid for r n ^ arg x < r m + x . 
The relation between Zs *i (x) and Z x (x) is that stated in (38). 

In § 13 it was shown that, if matrices Z x (x ),..., Z N (x) had the above 
properties and the further properties that they were analytic in the 
finite plane of determinant not zero for x ^ 0, and at x = 0 were 
properly equivalent to a Cauchy matrix, then these matrices were 
solutions of a canonical system (34) with irregular singular point 
at x = co. The same arguments can be used to establish that the 
Z x (x),.. .,Zs ( x) before us are solutions of a differential system (24) 
having coefficients rational in character at x = co, with poles of order 
not more than p. But we have proved in § 12 that the matrix solu¬ 
tion of such an equation is properly equivalent to that of a canonical 
linear differential system at x = oo. Consequently a transformation 
~Z(x) - A (x)Z (x), where A(x) is a matrix of functions analytic in 
character at x = oo of determinant not zero there, makes Z(x) the 
solution of such a canonical system. In particular the matrices 

Z x (x) = A (x) Z x (x), .. Zn (x) = A (x) Zs (x) 

form the solution of our problem, a fact which is apparent if we 

note that S(x) = -4(x) S(x) has the same form as S(x). 

It has therefore been completely established under the stated 
restrictions that the characteristic constants which occur in the 
characterization of the matrix solutions of a canonical linear differen¬ 
tial system can be chosen at pleasure. 

The restriction that no three of the quantities ai,...,a n shall lie 
on a straight line is not essential, for if it is not satisfied and if no two 
of the polynomials p,(x), ..., p„(x) are identical it will be possible to 
replace the rays r x , ..., t n which have coalesced by an equal number 
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of curved rays so chosen that the real part of only one of the differences 
v (.r) — Pi to changes sign along the ray, and thence to apply the 
meliminarv theorem in much the same fashion as before. If two or 
more polynomials p, to,..., p» to are identical it is not necessary to 
modify the nature of the rays. 

Furthermore it would be possible to construct an analogous existence 
proof when S(x) has for its elements so-called normal scries. 

In other words, the results here obtained are of an entirely general 


nature. 

§ 15. The Generalized Ricmann Problem. 

The problem which I proposed in my paper on singular points (I) 
was the following: “ To construct a system of linear differential equations 
of the first order with prescribed singular points 


of respective ranks 


Xu Xu.. X m , Xm.l = 00 


q !»•••» Qm+U 


and with a given monodromic group, the characteristic constants being 
assigned for each singular point.” This problem is virtually solved by 

what precedes. . _ . ... . 

It is of course understood that certain obvious conditions ol com¬ 
patibility are satisfied, the first being the one already noted, 

Tm+lTn... T\ = I. 


However a second necessary condition must also be imposed. Take 
any assigned point * = The solution Zito appertaining to this 
point (see § 14) is transformed successively into 

Zi to (i + Ci)- 1 * z > <*> V + c*)- 1 or + Ci)- 1 , 

...>Zs+x(x)(I+C N )-'... (7+Ci)-S 


as x passes over the rays n, ... t n respectively. Hence after a com¬ 
plete circuit of n„ Z { to alters to 

z x to r (/ + c N )- 1 ...(/ + to, 

where the matrix 7\ of transformation is explicitly determined in 
terms of the characteristic constants. But in order for this^ set of 
characteristic constants to be possible, some solution Z (.r) = z i to C 
must undergo precisely the transformation by T it i. e., 


Ti = CTiC 1 (t = 1,. . m). 
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This second condition is satisfied if the elementary divisors associated 
with T t and T { are the same. 

With the understanding that the two conditions of compatibility are 
satisfied, we can assert that there exist m + 1 canonical systems with 
the prescribed characteristic constants at Xi,. ■ -, .»•*,. i ami with solu¬ 
tions V x (x),..., V m +\ (x) undergoing a transformation to V\ (.r)7\,.. ., 

V m + \T m +i as x makes a positive circuit of X|.r n .i respectively. 

Hence there will exist a matrix )’ (x) of elements analytic and of de¬ 
terminant not zero save at X|,..., x»,i, properly equivalent to I (.r) at 
x = a, (i = 1,. .. ,m), and properly or improperly equivalent to V„ tl (.r) 
at j n »i = oo. This follows upon application of the results of § 10. 
The matrix Y (x) thus obtained is a matrix solution of a differential 
system of the form required and has the monodromic group and 
characteristic constants required, if the equivalence at x ntX = co be 
proper. This is an immediate consequence of the results of §§13, 14. 

If this is not the case it is easy to show that if one merely increases 
the characteristic constants r Xt .. .,r H for the singular point x**., = co 
by suitable integers this equivalence at infinity becomes proper. For 
consider the matrix S(x) corresponding to the matrix solution of the 
equivalent canonical system at «. The corresponding matrix 
S(.r) for y(x) is of the form -4(x) S(x) where the elements of /l(.r) are 
rational in character at x = co. Hence if a proper choice of C be 
made, the matrix 5’ (x)C may be written (compare (33), § 1.1). 

e p ' (x 'x f i ^ou + ^ + ... 

e*M*>x*i ^a M -f ~ .fW" J 

where r lt ..., r„ differ from n, ..., r„ by integers. By a process of re¬ 
duction exactly like that given in § 11 this matrix may be replaced by 
a scries of matrices of the same form in which the exponents r x , .. ., r n 
are increased so long as | a, ; | ^ 0. We conceive of }' (.r) as affected 
by the same series of operations, which have no effect on the proper¬ 
ties above specified. Furthermore the transformed F(.r) and S(x) 
stand always in the same relation to each other after as before trans¬ 
formation. If the process comes to an end so that \a ti \ j* 0, the 
linear differential system with matrix solution Y(x) will have a singu¬ 
lar point of rank q m +1 at x = oo (compare § 13, pp. 547-548) and the 


). (am +*" + •••) 
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associated properly equivalent canonical system has characteristic 
constants only modified as stated. 

V 0 w we have 

I Y(x) ~|S(x)j 


in the complete vicinity of x = oo, which is made up of the sectors 
(n,T 2 ), . • (T//,rtf + i). It follows that the series for | S (x) l converges 

and hence must be of the form 


«*»«+. .♦*„ (z, -x*(« + ^ + 



a 0, 


so that always 

n + . . . + u i: p- 

But the above reductions increase r x + • • • 4- r„ and do not affect 
I S(x) | or the value of p, and so must terminate. 

We can state then that either a solution of the stated problem, or of 

a modified problem in which the constants r,.r„ of one of the singular 

points arc altered to r» + r n + /„ respectively , where are 

integers, will exist. 

The matrix Y(x) thus obtained is not always unique. I he most 
general determination is however of the form P(x) V(x) where P(x) 
is a matrix of polynomials of constant determinant which fulfills other 
conditions. Thus the notion of “primitive systems” admits of 
extension to the case of irregular singular points. 23 


Part III; The Linear Difference Equation Problem. 

§ 1C. Formulation of the Problem. 

Let 

(44) y (x + 1 ) = Q (x) Y (x) 

be a linear difference system in which the elements of <?(x) are poly¬ 
nomials of degree p in x. 24 In my earlier paper on linear difference 
equations I demonstrated that, at least if the above equation admits 
a formal matrix solution 

(45) S(x) = [x* 4 * (p>e _M ) z x r; Si> (x)), 

in which s, ; (x) is a power series proceeding according to negative 
powers of x with the determinant of the leading coefficients not zero, 

23 Cf. Plemelj, loc. cit. pp. 240-245. Like theorems may be proved in a 

similar manner here. . . , . 

24 Essentially the most general linear difference system with rational coei- 
ficients may be reduced to this form; see II, § 5. 
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there exist two matrix solutions Y~ (x) and Y* (x), with elements 
analytic in the finite plane save for poles, such that Y~ (x) ~ S (.r) in 
any left half plane and Y+ (.r) ~ S (x) in any right half plane. 25 The 
existence of such a solution was proved by Norlund and Galbrun by 
methods based on the Laplace transformation somewhat earlier. 26 
These matrices Y~ (x) and )’♦ (.r) are connected by a relation 

(40) Y-(x) = )'* (x)P (x) 

where P (x) is evidently a matrix of periodic functions of period 1. 

From the form of (44) it appears that Y~(x) is a matrix of entire 
functions, while Y+ (.r) is analytic save for poles. 

In my paper I determined explicitly the nature of the elements 
Pij (:r) of P (x) to be the following: 

(47) 

p«(x) = 1 + «,<»«*' .. .+ «*»'«"» V-IX+ e! .r ( V-lg2.» V-.x 

(i = 1, . .n) 

p» (*) = V->x [c( .,., + . . . + «/•<-» vr, 'l 

(i j; i, j " 1, .. •» n), 

Here stands for the least integer as great as 

(48) [log Pi — log p(]) • 

An analogous determination in certain cases at about the same time 
was made by Norlund (loc. cit.). 

It is not difficult to show that, if Y~ (x) and Y*(x) have the properties 
above outlined, then conversely they are solutions of a linear difference 
system (44) in which the elements of Q (x) are rational if not polyno¬ 
mial. 27 For this reason the constants p,-, r jt cj k) may be called the 
characteristic constants of Y~(x) and F*(x). 

This characterization suggested to me the following problem: To 
construct a linear difference system (44) with assigned characteristic 
constants in which the elements of Q(. r) are polynomials in x of 
degree not greater than p. 

25 These matrices Y~{x) and K*(x) correspond tof7(x) and //(x) of II. In 

certain cases it may be necessary to consider half planes not bounded by a 
vertical line. I refer to this possibility later. . . 

26 Norlund, Dissertation, Copenhagen (1911); Galbrun. Dissertation, Pans 
(1910). 

27 See II, § 7. 


292 


BIRKHOFF.— THE GENERALIZED RIEMANN PROBLEM. 


555 


§ 17. Solution of the Problem of § 1G. 

In order to treat the problem of § 16 we apply the preliminary 
theorem. We shall take r = 1, and take C x to be the axis of imagi- 
naries in the complex plane unless | P(x) | = 0 at a point of that axis. 
The matrix A x (x) is taken equal to 

T (x) P (x) T- 1 (x), T (x) = (x~ (pi^Y *■%). 

except near to x = 0, where it is chosen in any way so as to satisfy the 
restrictions of the preliminary theorem there (compare § 9). Since 
the elements of T(x) are in general multiple-valued functions of x, 
it is necessary to specify which branch of T(x) to select. We shall 
choose a continuous branch of T(x) in the right half plane, and a 
continuous branch in the left half plane in such a way that these 
branches coincide along the upper half of the axis of imaginaries. 
The first factor T(x) in the expression for A i(x), will be identified 
with the first of these branches, and the last factor T~ l (x) will be the 
inverse of the second of these branches. 

It is therefore clear that, along the upper half of the axis of imagina- 
ries, the element in the t'th row and jth column of A x (x) is, for i j, 

e 2 .Ui Vri, pi x p .-x a .r r r> [ C|> ( 0 > + . . . + V " lx ] 

while the diagonal elements are the same as for P(x). But by defini¬ 
tion of \j, 

(49) 1 > 9f (*<> - (>°g Pi ~ lo K P‘)J = °- 

Let us exclude at present the case of the equality sign; the element of 
A x (x) in the ith row and jth column (t ^ j) will therefore vanish to 
infinite order together with its derivatives as x goes to infinity along 
the upper half of the axis of imaginaries. The diagonal elements 
diminished by 1 have the same properties. 

Hence we have A x (x) ~ / along the upper half of the axis of imagi¬ 
naries, while all the derivative matrices of A x (x) tend to matrices of 
zero elements as x becomes infinite. If ^li(x) has this character along 
the lower half of the axis also, it is clear that this matrix satisfies all 
the restrictions imposed in the theorem. 

Let us demonstrate that such is actually the case. The determina¬ 
tion of T(x) on the left-hand side of the lower half of the axis of imagi- 
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naries is obtained from that on the right-hand side by a complete 
positive circuit of x = 0, during which TV), changes from 

(x-* (p,e->‘) z x r j6ij) to (c-’* v: ‘V' v - ,, ix»*' (p,<r»y x r >5 i j). 

The ith diagonal element of ylj(x) may now be written 

ca m c~- w v:,r it vr,x + . • • + b 

while the element in the ith row and jt\i column of J(x) (i j) may 
be written 

c cr<x ir i> V-i Xpf P) -* x Ti- r i V " |z + • • • + c./*" 1 '). 

Bearing (49) in mind we readily perceive that Ji(x) does have the 
indicated properties along the lower half of the axis of imaginaries. 

According to the preliminary theorem we can then determine a 
matrix <I>(x) such that 

(50) , , iT 1 -4>(x)-[,»T,.<*>W)-liW. 

where .ri is a point of the axis of imaginaries and the approach is from 
the left-hand and right-hand side of that axis respectively. If we take 
x - a - 0, the determinant of <P(x) is not zero in the finite plane except 
at x — 0 possible, and the elements of this matrix are analytic at any 
point not on the‘axis. Along the axis as defined from either side these 
elements have continuous derivatives of all order, and will be analytic 
at more than certain distance d from the origin. In the vicinity of 
x = oo , <t>(.« ) is represented asymptotically by a matrix of series in \ /x 
with determinant of leading coefficients not zero. This matrix is the 
same on either side of the axis, since A (x) «• /• 

Let us denote <P(x) by U+(x) for .r in the right half plane and by 
U- (x) for a- in the left half plane, and write 

Y+ (x) = U* (x) T (x), Y- (x) = U' (x) T (x). 

From equation (50) we see then that 

(51) F-(x) - YH*)PW 

for | x | > r along the axis of imaginaries. From the asymptotic 
form of (/♦(*) and C (x) at x = co we obtain 

r-(x) ~ S(x), Y+ (x) ~ S(x), 

in the left and right half plane, where S(x) is of the same form as S(x) 
above. The relation (51) shows that )'“(x) is composed of elements 
analytic in the right half plane. 
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Let us now apply the preliminary theorem a second time, taking 
r = 1, and for C, a circle with center at the origin and radius so large 
as to include within it all those points of the axis of imaginanes at 
which an element of -i,(x) as chosen above is not analytic, and also 

so as not to pass through a zero of | K“(x) I- M , 

In this second application of the theorem we choose AiW to oe 
[Y-(*))-', and in this way satisfy the restrictions of the theorem. 
Furthermore let us take a = oo. 

Along the circle C. we have for the solution <*>(x) 

(52) * ( 2 )] [F- (xi))-', 

where the approach to the point x, of C is from without and within C 
respectively. Now write 

y- (*) - * (*) Y- (*), Y* (*) = * (*) Y* (x) 

for x outside of C„ It follows that Y~(r) is composed of elements 
analytic in this region; also along C 1 , )’“(x) coincides with the inner 
determination of <t>(x) by (52), so that the elements of Y~(x) are also 
analytic within and on C,. Hence )'-(x) is a matrix of entire functions. 
Similar considerations show that the elements of Y+(x) are analytic 

in the right half plane like the elements of V+(x). 

At x = 00 , Y-(x) and Y*(x) are asymptotically represented by a 
matrix S(x) in which however r„ ..., r„ are not necessarily the same 
as in S(x), and in which the determinant of the leading coefficients 
may be zero. This results from the fact that the elements of 4>(.t) 
are* rational in character at x = 00 and in consequence can be ex¬ 
panded in convergent series in descendingintegral powers of x. In¬ 
asmuch as we have \S (*) | - | * (*) H S (*) |, it is also true that 
| S (.r) | cannot reduce formally to zero. 

Finally from (51) and (52) we infer that 

(53) r-to- Y*b)PW. 

A first conclusion to be derived is that F”(j) S(x) in any left half 
plane, and that also V (x) ~ S (x) in any right half plane. In fact 
we have already determined the asymptotic form of P (x), and this 
known form combined with the known asymptotic form of }'*(.r) in 
the right half plane gives us the form of Y~(x) in the part of the plane 
to the left of any line parallel to the axis of imaginaries; a similar 
remark applies to the asymptotic form of )’ + (x) in any right hall 
plane. 
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One further remark comes in appropriately at this point. The 
definition of Y (x) and F'(x) ensured that | Y~ (*) | and | V* (x) | do 
not vanish to the left or right of the axis of imaginaries respect.vely, 
save possible for x - «o. Henee | Y~(x) | and | Y*M I do not vanish 

in these left or right half planes respectively. . 

We may now enter upon a series of modifications of \ (x) which Mill 
preserve the above stated properties and secure in addition that the 
determinant of the leading coefficients of S(x) is not zero. To this 
end we write the matrix S(x) in the form 




a„ + 6 " +.. •).*■* (Pne-y x r - (a>, + + • • •). 


r »*X 


(nrVf'fa + T + ’ ‘.*'■' +6 f + ■' ')i 

and carry out reductions parallel to those given in § 11. The same 

reductions are supposed to be simultaneously efrected upon l UO and 

y*(x) (compare § 15). This set of reductions will ter nmate smee 
| S(x) | does not vanish identically, and, when it does, 1 (.r) and 1 (.r) 
will have the desired additional property. 

Consider now the matrix 

Q (!) = y- (x + i)[y- (x )]- 1 = y*(x+ i)i y* (*)]-. 

These two forms for (?(*) are equal in virtue of (53) and yield us at 
once the asymptotic form of the elements of «(*)»» COmp ' C * C { 
vicinity of x = ■» as descending power series in * with leading term of 
the “th degree at most.** Hence the elements of Q (*) are rational 
in character at x - «*>. with a pole of at most order M there . 

In the finite plane to the left or along the a.x 1S of imaginancs the 
first expression for Q(x) shows that the elements of Q(x) are anal>tic 
Shout exception. It will be recalled that Y~(x) s a matrix of entire 
function of determinant not zero in_ thei Wt half plane 'nc'usiveofthe 
axis of imaginaries. On the other hand the elements of Q(x) areana- 
lytic to the right of the axis of imaginaries, as the second form shows. 

Accordingly Q (x) is a matrix of polynomials of degree at most M 
and Y-U), Y*(x) are solutions of the rational difference system (44). 

Finally it may be observed that in case | P(x) j - 0 along the axis 
of imaginaries, a y parallel lin e may be used to take the same r61e; or 

28 Cf. I, 17. 


296 






BIRKHOFF.— THE GENERALIZED RIEMANN PROBLEM. 


559 


indeed any simple analytic curve without a horizontal tangent and 
with vertical asymptote, provided that | P(x) | 0 along the curve. 

If the equality sign obtains in (49) it will be necessary to employ 
a curve with asymptote not quite in the vertical and to employ half 
planes not bounded by a vertical line. 

It is also possible to replace S(x ) by certain anormal forms, 2 and 
thus extend the above results to the most general case. 

Our conclusion may be summed up as follows: There exists a lincur 
difference system (44) with matrix solutions Y~(x), Y+(x) which cither 
possesses prescribed characteristic constants pj,r it cj k \ or else constants 
p jt Tj+lj, c,/ fc ' where l u are integers. For an arbitrary curie 

which meets each line parallel to the real axis only once, having a ver¬ 
tical asymptote, and which docs not pass through a point | P(x) | = 0, 
there exist such matrices Y~(x), Y+(x) with the further property that 
| Y~(x) 1 5 ^ 0 to the left of the curve while the elements of l’+(.r) arc ana¬ 
lytic and | Y+ (*) | 5 * 0 to the right of the curve. 

It is worthy of note that this last property determines the location 
of the poles of the elements of F*(x) completely: namely, they occur 
to the left of the curve and at the points for which | P(x) | = 0. This 
appears from the formula 

Y+(x) = Y-(x) P-> (*), 

which also permits us to affirm that the precise maximum order of pole 
of any element of Y+(x) is the order of the zero of | P(.r) |. 


Part IV: The Linear ^-Difference Equation Problem. 

§ 18. On Linear q-Diffcrencc Equations. 

A linear ^-difference system may be written 

(54) Y(qx) = Q(x) Y (x) |f | > 1. 

where Q(x) is a matrix of polynomials of degree p or less, in analogy 
with the normal form (46) of linear difference systems. The apparently 
more general case in which the elements of Q(x) are rational in .r may 
be reduced to this form readily. Let the least common denominator 
of the elements of Q(x), written as quotients of relatively prime poly¬ 
nomials, be 

(x — ai)...(x — a{) 


29 Analogous to the anormal series for linear differential equations. These 
forms have recently been obtained by Mr. P. M. Batchelder. 
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and let Si (.r) be a solution of the -/-difference equation of the type (54) 
(55) 9 <7 J ) = ~ m) 9 <J ' ) 

for m = If one takes for new variable 

rw = W- jiWi'W. 

a new matrix equation (54) in ?M is obtained with (?(*) polynomial 
in .r. 

Let us write 

t = loR - 

q' 

In terms of this new variable a solution of (55) for m = 0 is 

9 Kr-«. 

For mi ^ 0, the transformation 

lORX u>*i 

.r = mx, y (x) = «’ V- ‘ *°« • »' fog * V W 
takes (55) to the normal form 
(56) 2/(9-0 “ (1 “ x)y(x). 

Two solutions of this equation are 

-(i-!X l “?)•••’ 

y x (*)- ,»«’-«*-' v -“ • -ji • ■ •• 

1 x 1 qx 


(57) 


as one may verify by direct subst.tution. The function !/«(*) P>“> 3 
the same role for the linear -/difference equations as the gamma func¬ 
tion does in the theory of linear difference equations. I haxe men¬ 
tioned these functions in order to supply an example later. 

The fundamental existence theorems for linear ,-difference equa¬ 
tions are essentially a consequence of the work of C^yy jnd 
The first complete treatment has been given by Carmichael, and the 


32 P ,r J lur Si MaT 34. 147-168 (1912). 


31 Paris thesis, 1897. 
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result may be expressed as follows: There exist in general two matrix 
solutions 

( I'lW = (x’l 0,7M) 

(58) r.W^ H (r4,W), 


where each function a,j(x) is analytic at x = 0 , and each function b,j (x) 
is analytic at .r = co ; and where furthermore the determinants of the 
leading coefficients of a,j (x) and b fi (x) at .r = 0 and x = oo respec¬ 
tively are not zero. It is only the case when such series exist that 
will be here considered. 

It follows at once from (54) that l'o(-r) is a matrix of functions ana¬ 
lytic for x ^ 0, oo, and that (.r) is a matrix of functions analytic in 
the finite plane except for poles when x ^ 0. Further, if we write 


Yo(x) = y B (x)PW, 


then P(. r) is a matrix of functions analytic for x ^ 0, oo, and possessing 
the property that P(qx) = PM* 3 These properties are in close 
analogy with the properties for a linear difference system, to which 
indeed (54) reduces formally by the substitution .r = (f. 

I propose now to determine completely the nature of P(. r), as I 
have done for the analogous functions P(x) associated with the linear 
difference system; it is the doubly periodic functions which enter here 
instead of the simply periodic functions. By means of this determina¬ 
tion it will be possible for us to state the problem which, for this field, 
is analogous to the problems above treated for linear differential and 
linear difference equations. 


§ 19. On the Matrix P(. r). 

Let us make the transformation x = (f and write 

PM = P(t). 

The function P(t) is a single-valued function of t analytic save for 
poles; for, this transformation takes the Riemann surface of infinitely 
many leaves, with logarithmic branch points at x = 0 and x = oo in 
a one-to-one and conformal manner into the /-plane. 

Let us conceive of the /-plane as divided into parallelograms which 


33 Cf. Carmichael, loc. cit., p. 159. 
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belong to the periods » = 1. «' = 2xV_ 1, log?, and let A BCD 
be a parallelogram with vertices 

, . 2ttV— 1 , 2xV 1 

a*, .vo + 1, *o + 1 + - fog q~’ - To + log <;« ' 

respectively^ At homologous points of BC and AD, P(t) has the same 

value, since P(t + 1) = P( 0- . . , , R 

To obtain the relation between P(t) at homologous points of AB 

and DC we consider first the matrix 

(59) P(*) = Y «~ l (*) Yo(x). 

It is apparent from the form of the elements of 1 o(-v) near .r * 0 as 
given by (58) that, if a positive circuit of * - 0 be made, 1 0 W 
will change to Y 0 (x)K, where 

K = (c 2 *'i< = s «o); 

likewise, upon a similar circuit, I'«>(*) will change to 


(— l)**e 2,rM V '" 1 FooW 

where 

L = v=i ««)- 

If these modified matrices be substituted in (59) we obtain the form 
which P(x) assumes after x has made a positive circuit of the origin. 

This is 

(_ i). e —*"• V -‘ '(NiL-'PW AT. 

If therefore p„ (*) denotes the element in the ith row and jth 
column of P(x), such a circuit modifies p,,{x) to 


(_ i). e -2 " ' /_1 ‘e^*«e 2,w+ ^ V-1 Po (*)- 

But this circuit in the x-plane corresponds to a passage from a point 
of in the (-plane to the homologous point of DC; in this way, let¬ 
ting va W stand for the element of P(0 in the ith row andjth column, 
we find 


(60) yij (l + 


2 t rV— 1 

log? 


-) = (- 1 Ye 


V-i p iy (() 
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We have also seen that 

(61) PH (* + 1) = Pij (0- 

Now let us attempt to satisfy (60) and (61) by writing 

(62) pij (0 = ce°‘ ,+ *<r (t — ai) ... a (t — a u ), 

where <r(t) is the Weierstrass sigma function belonging to the periods 


w = 1, 

which satisfies the relations 


2 w: 


CO = —r- 


log? 


(63) 


a(t + 


log 9 


V=I> 


= _ /(“W) 


«(<). 


*(<+ 1) = _ e n«+» ff (0, l -'og - --V = 2»V_1. 

The above choice of w and w' meets the requirement SR (w'/w) > 0, 
since | q | > 1- 

A direct substitution into (61) determines 


(64) 


a = — b = V H a,- + n* V— 1 + 2 kir V — 1, 

- *-i 

in which h denotes any integer. If these values of a and b are taken 
and a direct substitution is made in (60), there is obtained the further 
condition _ _ 

* , Mir V_ 1 , , , 2 kir V— 1 

(65) L a< H- i ~ _ = o{ + pi + l — 


•-1 


log q 


log q 


in which l denotes an integer. These conditions are necessary and 
sufficient that an expression of the form (62) shall satisfy both (60) 

and (61). If the value for Ea, deduced from (65) be used in (64) 

»■ i 

the expression for b simplifies to 


V (**• + Pj + 0 — V + kj- 


But if one adds or subtracts a period to a, the precise effect is to alter 
k or l by an integer. It is therefore always permissible to take k = 1 = 0 
and to write 

(66) p«(0 = Cij e~^ r+ ^‘ +,S> ~ a (t — «,«>) . ..a (t- a„<V’), 
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provided that 


21 ax (,,;J = <?i + Pi — 

x-i 


H* V— 1 
log q 


This is of course under the assumption that p„(f) may be represented 
in the form (02). But this fact may be proved at once. For let 
be anv function which satisfies (60) and (61), and 4,(1) the part.oular 
one above obtained. The function *(<)/* (0 « doubly periodic, 
analytic save for poles, and can therefore be expressed as a quotient 
of products of sigma functions 

r o (l — 7i) ... <r(t —7k) - _ 

<r (t — 0i) . . . * (t — fa) ’ 

But this quotient when multiplied by <t> (0. which is expressed as a 
product of sigma functions, must yield ^ (/), an entire function. This 
necessitates that for each zero of a (f — 7>) m the numerator 
(i = 1, .. .,/i) there must be a congruent zero x = ft of a (x — ft) m 
the denominator. Such pairs of corresponding factors may be com¬ 
bined leaving only an exponential factor Thus (t) appears in 

the same form as <t> (t). . 

Our result is therefore that the element p i; (x) of / (.v) is of the form 
(GO) (/ = log x t log q), where p u .. .,p„. *u • • •» *n * re t,ie constants 
that appear in the series representations (58), and where the condi¬ 
tions (07) are fulfilled. . . . . , 

It is interesting to apply these results to the equation (oG) m "Inch 
n - 1, p - 1, pi - 0,<ri - ir V—1 log q, and 1' 0 W and )*» (*) reduce 
respectively to Uo W and (x) defined in (57). In this case we have 
therefore 

(68) y<>(*) = Vx WpW 

where 

The constant c may be determined by writing (68) in the form 
i/o (*) = [(* -1) y* Ml 

and allowing * to approach 1. Since <r(0) = 0,tr'(0) = 1, this gives 
(see (57)) 
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The relation (GS) with the explicit values of yo CO, y x M p( x ) 
substituted in is essentially one of the fundamental product formulas 
for the sigma function. 

§ 20. The q-Diffcrencc Equation Problem. 

It is now easy to show that conversely if }' 0 (.r), Y x (.r) are matrices 
of functions of the form (58) in the vicinity of x = 0 and x = co 
respectively, analytic for x 5 ^ 0, co, save for poles, and if the matrix 
P(x), defined by the relation l'oto = Y„(x)P(x), is composed of 
elements pa(x) which are left unchanged when x is replaced by q. r, 
then Yo (.r) and Y x (x) are matrix solutions of a linear ^-difference 
system (54) with rational coefficients. In fact, if we write 

Q(x) = Yo (qx) Yo-' (x) = F x (qx) Y x ~' (.r), 

it is seen at once that for x 7* 0, oo, the only singularities of Q(x) 
are poles, while the first and second of these forms for Q(x) ensure that 
Q(x) is composed of elements analytic at x = 0 and with a pole of at 
most order y at x = co if not analytic there. This suffices to establish 
the fact that the elements of Q(x) are rational. 34 In order to conclude 
further that the elements of Q(x) are polynomials of degree y it is 
sufficient to know that the plane may be divided into two parts by a 
loop about x = 0 meeting each equiangular spiral or radial line 

( 70 ) 0 = c + Jog r (r, 6, polar coordinates) 

v log ! q , 

only once and not passing through a point | P(x)\ = 0, such that the 
elements of }’ 0 (x) are analytic and |V 0 (x)| is not zero within or along 
the loop, while the elements of Y„ (x) are analytic and | Y„ (x) | is 
not zero outside the loop. Under these conditions the first expression 
for Q (x) makes it evident that its elements are analytic within or 
along the loop, and the second expression makes it clear that the same 
is true without the loop, since if x is a point without the loop so is qx. 

It is natural to term the 2 n constants pj, <jj and the n- {y- f 1) con¬ 
stants Cij, ai (,,;) , .. ., the charaeteristic constants. These constants 
are not all independent, since there are n- relations between the con¬ 
stants a t (,J) of the type (G7). Furthermore the constants r„- are not 
uniquely determined by the given ^-difference system. For, any Jth 
column of 1’ 0 (x) is only determined up to a constant factor /, and like- 

34 Compare II,- § 7. 
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wise any ith column of Y„ to is only determined up to a constant 
factor This fact appears from (5S). The efTect is to allow one 
to replace c„ in Pij (x) by fa/h* and thus vary at will 2 if - 1 of the 
characteristic constants c ti . There remain then essentially only 

2n + n 2 (jx + 1) - »* “ (2n — 1) or n*/z + 1 

characteristic constants. It is readily verified that_thesc constants 
are all invariant under a transformation )'(*) = CY(x), where C is 
an arbitrary matrix of constants. . 

But the equation (54) involves n*(ji + 1) arbitrary coefficients in 
Q(x) of which there are n* 0* + 1) - (»’ - 1) or + 1 invariants 
under the same linear transformation. Hence we have found as many 
invariants for the g-difference system as invariant characteristic 
numbers for the solutions. 

We are thus led to formulate the following problem: To construct 
a ^-difference system (54), with coefficients of degree not greater than 
P in x, having any assigned set of characteristic constants. 


§ 21. Solution of the Problem of § 20. 

Here also we shall employ the preliminary theorem, but the appli¬ 
cation of it is even simpler than in the earlier cases. 

The conclusion that we shall derive is the following: There exists 
a linear q-diffcrence system (54) with the matrix solutions Yq to, Y to 

either possessing prescribed characteristic constants p it Oi,Cij, oi . .,a„ *• 

or else constants p„ c, + />, c„-, «,«•».i, M , "here /,. X are integers. 

For an arbitrary loop about x = 0 which cuts each spiral (70) only orwe 
and does not pass through a point | P(x) | = 0, there exist matrices 
Y 0 to, T* (x) with the further property that | y 0 (x) | ^ 0 unthin or along 
the loop while the elements of Y m {x) are analytic and | Y„(x) I w not 
zero without the loop. 

Let C, be a specified loop of this description which may be taken to 
be analytic. 36 We may take the matrix A^x) of the preliminary 
theorem to be , x m , . 

T„ (x) P to T *- 1 to» 

where 


To (x) = m T a (z) 




since the elements of -4,(x) are single-valued and analytic along C, 

by (GO)._ 

36 It is only a question as to how the loop weaves among the zeros of |P(*)I- 
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According to the theorem there exists a matrix $>(*) such that 


(71) 


11 m 


.?♦*(*) = [,!£-*(*)] AM. 


where x, is an arbitrary point of C, and the + and - signs denote 
approach to *i from within and without C i respectively; the matrix 
4 >(x) possesses certain other properties: it is composed of elements 
analytic for x not on C u save at a point a which we shall take to be at 
infinity; furthermore its determinant does not vanish in the finite 

P, Let us denote by U 0 (x) the matrix 4>(x) within C u and its 
analytic extension across C,; similarly by U x (x) let us denote the 
matrix 4>(x) without C x and its analytic extension across C\. Consider 
then the matrices 

YfT (*) = U * to ^ <*>. (*) “ U » W T m (*)• 


From (71) we obtain at once 


(72) 


l'o-(x) 


y»(x)P(x), 


and prove without difficulty that Y 0 ~(x) and Y„+(x) as thus defined 
have the characteristics demanded save possibly that >’« (x) may not 
be precisely of the form (58) at x = oo, as it would be if 4>(x) were com¬ 
posed of elements analytic at x = oo and if also |*(x) | were different 
from zero at x = oo. Nevertheless one can always write Y « (x) in 
the form 


I 


ar 1 ' («n + “; + 
x- ?1 ^a„i + ^ + 



‘ • r " 5 " ( a,n + ^ + • * *) 
. . x- 5 " (a nn + ~ + * * *) 


where a,.differ from by integers. By a process of 

reduction precisely like that employed earlier (§11) one may further 
modify Y 0 (x) and simultaneously Y^ (x) so as to preserve all of the 
properties already noted and to finally obtain | a 0 | ^ 0. It is to be 
recalled that | Y 0 (x) | is not identically zero and can at most vanish 
to a finite order at x = oo ; for it is this fact that enables us to conclude 
that the process of reduction terminates. 

The argument of the preceding paragraph shows that Y 0 (xj and 
y w (*) will be matrix solutions of a system (54) with coefficients poly¬ 
nomials in x of degree n at most. 
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It is deserving of notice that the properties stated above determine 
the location of the poles of the elements of )' w (.r), namely at the zeros 
of | PM | within the loop, the maximum order of the pole of any 
element being precisely the order of the corresponding zero of | PM I- 
This is an immediate consequence of the relation. 

r.w = y 0 M p-' to- 

Harvard University, 

Cambridge, Mass. 
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ON A SIMPLE TYPE OF IRREGULAR SINGULAR POINT* 


BT 

GEORGE D. BIRKHOFF 


Introduction. 

A simple type of irregular singular point is furnished by the linear differential 
equation of the second order 

(1) g+p(*)|+9 ( *)»-° 

in which the coefficients p(x) and q (*) are analytic at * - “ • Th * 
point x - «> is then an irregular singular point of rank 1 unless xp (x) an 
3 * q( x) remain finite at infinity. If we write lim p ( * ) =» po, Jim q (*) - ?o, 

and if the roots b x , b 2 of the quadratic equation 

b* + po b + (Jo 53 0 

are distinct, the transformation of variables from x, y to £, y, 

y=e?'*x ei y, x=£(bt — bi), 

will take (1) into another equation of the same form, in which however 

(2) p(*)--i + 5+—» ?(*) = B + ? + 


provided that a proper determination of the constant c x be made. 

In the present paper I wish to consider the solutions of (1) in the vicin¬ 
ity of x = oo , but under the restriction that p (x) and q (x) have the form 
(2).t The method of attack is essentially the same as that which I have em¬ 
ployed earlier in the consideration of singular points-t In this special case the 
results may be given a more striking form. The last two theorems have no 
analoguesin my earlier paper and are susceptible of wide generalization. 


m 


• Presented to the Society, September 12, 1911. ... » t .u. 

t This type of equation has been considered in a paper by Horn, JoumaKur Maine 

a t i k , vol. 133 (1908), pp. 19-67. 

t These Transactions, vol. 10 (1909), pp. 436-470. 
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The treatment will be based on the following auxiliary theorem:* Let 
l X i(x), In (x), hi(z), ht(x) be functions single^talued and analytic for 
| x | ^ R (but not necessarily at x = « ) and such that ln(x)ln(x) — 
ln(x)hi(x) does not vanish for | x | ^ R. Then there exists a set of func¬ 
tions an(x),a u (x),an(z) t an(x) analytic at x = « and reducing to 1, 
0,0,1 respectively at x =<*>, and a set of entire functions e xx (x) , e x % (x) , 
eu (x), e u (x) for which e xl ( x) e n (x) - e X t(x)eu(x) is nowhere zero in 
the finite plane, such that 

hi ( x) = [ an (x) en (x) + (*) **i (*)) ***» 

ht (x) = (an (x) en (x) + <*it (*) «n (*) 1 x kt , 

® hi (x) = [(hi ( x) en ( x) + an (*) «ii (*) ] x ‘'> 

hi (x) - [an ( x) en (x) + an ( x) e« (x)) x kt , 

where k x and k x are integers. 


§ 1. The Canonical Form of Equation. 

If we set x (dy fdx ) = z, we may replace the single equation (1) by the 
linear differential system 


(4) 


dy __ 1 
dx ™ x 2 


£ = -Z7(x)!/+(-p(*)+^)z. 


Let us now make a linear transformation of the dependent variables 

(5) y = a„ (x) Y + a lt (x) Z, a =» a,, ( I )Y+a„(x)Z, 

where the functions a„ (*), a„ (*), a,, (x), a» (*) are as yet unspecihed 
functions for which A (i) = a„ (*) a,, (x) - a„ (i) a„ (x) is not identic¬ 
ally zero. The new linear differential system in Y, Z will be 


dY 


dZ 


P tl (.x)Y + P»(x)Z, 


(6) Pn(x)Y + P„(x)Z, £ 

where the coefficients of Y and Z have the values 

* This theorem is the special c»..2ol. theorem which I have proved, M a t h e- 
m&tische Annaleo, vol. 74 (1913), pp. 122-133. This proof, based on the theory 
of linear integral equations, was first presented by me in 1911 (see Bulletin of the 
American Mathematical Society, vol. 18, p. 64) although 1 was in possesion of 
it a year or two earlier; a proof by an entirely different method was given by me in IMS (see 
these Transactions, vol. 10 (1909),p.438). I have recently noticed that the theorem 
ia ft special case of a very important theorem due to Hilbert, G6ttmger Nacfa- 
rich ten (1905), pp. 307-330, and Plemeu, M on at she f te f Qr Mathematik und 
Ph vsik vol. 19 (1908), pp. 211-222. This relationship becomes obvious if one identifies 
the functions Ut (*) with the functions «<, (x) in Plemeu's paper and solves the equaUonsbke 
(3) for an (x); for the boundary curve ( Randkurvt ) may be taken the curve | x | - K. Further¬ 
more the method used in my second proof is essentially the same as that used by them. 
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(7) 


Pn = * 9a, ‘ + (“ p + 5 ) 0,1 " )] ’ 

Pn = l [a„ (ia„ - a!,) - a u (- *?°» + ( " P + \) 

Pn = 1 [ - a,, (^a,, - a|,) + a„ ( - xqa a + ( “ P + i) ’ 

Pn - 1 [ - 0,1 C * 0 ** ■ °'“) +0,1 ("* ?a “ + (" P+ *)'“ “’’)]■• 

The accent here denotes differentiation, and for brevity the letter x is sup¬ 
pressed in the functional notation. If the functions a u (*)»«»(*)» ( *) ’ 
o, 2 (x) are analytic at x = « and reduce to 1,0,0,1 respectively at * - *> » 
the function A (x) reduces to 1 at * = 00 , and a direct computation by 
means of (7) gives (1) 

PnW-*? + ** + 


22 

X* 

:«> 


( 8 ) 


+ ••• 


An <*)-! + 


1-Pi , P» 


+ ^ + 


Later we shall make use of these expansions in series. 

By the aid of the auxiliary theorem above stated we shall choose the trans- 
formation (5) so that the equations (6) reduce to a very simple form. 

Consider the solutions of (1) in the vicinity of * = » . According to the 
fundamental existence theorem for ordinary linear differential equations, 
every solution of (1) will be analytic for |z| £ A. if A is chosen so large that 
p (*) and q (*) are analytic for | x\ £ A; and the general solution will be 
ci yi + c s y t , where c,, c, are arbitrary constants and y u yi are any pair of 
linearly independent particular solutions. But y\ and y, are not in general 
single-valued for | x | ^ A; in fact, when x makes a positive circuit of x = «> , 
yi and yi are only known to be replaced by some pair y ,, yi of linearly nde- 
pendent solutions so that 

yi = ci yi + c 2 y 2 , yi = d x yi + <k yt, ci d* - c 2 di # 0. 

Thus it is evident that yi, y 2 undergo a linear transformation when x makes a 
positive circuit of x = « . According to the well-known facts concerning 
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such transformations, it will then be possible so to modify the choice of y x , y 2 
that we have one of the two cases: 


(9) yi = Piyi, y2 = Psy*; or ft-pijn, yi-piyi + y*- 

The second of these alternatives is to be regarded as exceptional (pi = Pa), 
and we suppose at present that the first alternative is before us. 

If we define Ai and A* by means of the equations 


Ai 


7= log pi, 


2irV / — 1 



thus leaving \ x and A* arbitrary up to additive integers, and then define 
/„ (*), />2 (z), (x). In (z) by means of the equations 


yi - x Xx I n (z), 


y 2 = (z), 


<I0) «**.(«>. ^ = = 

it is clear that the functions Jy (x) are single-valued as well as analytic in 
the vicinity of x = » . To see this one has merely to observe that **■ and 
x*' are respectively multiplied by p, and p, when x makes a positive circuit 
of z = oo . Furthermore the expression 

ln(z)l»(x)-t a (x)l ,,(*) = *-*-*•( y. yi - y* yi) = *"*"*' *~ J 

is not zero for | x| £ fi, since p (x) is analytic. Consequently these func¬ 
tions In (x), (x), fn(x), fn(x) satisfy all the conditions of the aux- 

iliary theorem. 

Now let a n (z), a 12 (z), a 2 i(z), Om(z), fn(z), (z), c 2 i (z), e 22 (z) 
be the functions and *,, *, the integers fulfilling the fundamental equations 

(3) of the auxiliary theorem. # 

The pair of functions hi (z) and / 2 , (z) depend on the initial choice of 
Aj. If we multiply the equations (3) whose left-hand members are l n (z) 
and / 2l (z) by z"*‘, these functions are replaced by z“*> hi (z) and z %i (z) 
respectively. But this pa^r of functions is the same as the pair hi (z) and 
/ 21 (z) that would be obtained if Ai were replaced by Ai + *i. A similar 
remark applies to the pair of functions /i 2 (z) and ln(x). That is, by 
properly choosing the four functions In (z), fit (z), hi (z) , ht (z) we may 
take ki and k 2 equal to zero, and obtain 


hi ( x) = a n (z) «n (z) + a l2 (z) e 2 i (z), 
hi (z) = an (z) e 12 (z) + <*n (x) e 22 (z), 
hi (z) = a 2 i (z) «n (z) + a 22 (z) e 2 i (z), 
la (z) = a u (z) e x2 (z) + a 22 (z) e 22 (z), 
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where the functions aq (z) and eq (z) have the properties stated in the 
auxiliary theorem. 

Let us make this choice of the functions a n (z), an (z), a*i (z)» a** (*) 
in the transformation (5). Multiplying the equations (11) for ln(x), 
fa ( X ) and In (x), kj (*) by z*» and z* 1 respectively, we obtain 


Vi - an (a:) + a„ (z) Z, 

z, = a,i (z) y, + a« (z) Z, 


« - 1 . 2 ). 


where 


( 13 ) yi = (*)» = ***«« (*)» = ***«« (*), = z* *,(z). 

Hence, if the transformation (5) be chosen in this way, ( Y x , Z x ) and ( Y 2 , Z 2 ) 
are a pair of linearly independent solutions of the transformed system (6) 
of the form (13) where e u (*)» «i* (*)» «« (*)» (*) are entire functions 

such that en (z) e M (z) — «it (*) e*i (*) is nowhere zero in the finite plane. 

It is now an easy matter to determine explicitly the form of the coefficients 
Pa (z) in (6). If we substitute the above forms (13) for the known solutions 
in the first equation (6) we obtain at once 


+ 7«i. (*) = Pi i (*>(*) + Pit (*) «»i (*>. 


-J— + t««* <*> = p » (*)«»<*> + p» (*)««(*) • 

ax x 


On account of the fact that the functions eq (z) are entire, and that 
en (z) *22 (x) “ *i* (*) e 2 i (x) is nowhere zero, it follows from these equa¬ 
tions that Pn (z) and Pj 2 (x) are single-valued and analytic everywhere in 
the finite plane except for a possible pole of the first order at z = 0; by using 
the second equation (6) we may likewise show that P 2 1 (z) and P 22 (x) 
have these same properties. But the functions Pq (z) are analytic at z = co 
according to the formulas (8), which hold inasmuch as an(z), a 12 (z), 
fl2l (i), a 22 (z) are analytic at z = » and reduce to 1,0, 0, 1 respectively 
a t x = co . Accordingly the functions Pq (z) are linear functions of 1 /z, 
and are given by the series (8) which break off at the term in 1 / z. There¬ 
fore the equations (6) have the simple form 

( 14 ) x lL =TZ ' = ,y+ (I+ 1 “ Pl)Z ' 


where r and 5 are certain constants to be studied later. These equations can 
be integrated by a quadrature in the two cases r = 0 and 5 = 0. 

Bearing in mind the nature of the transformation (5) that takes the original 
equations (4) into (14), and also that rZ and 2 may be replaced by 
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x (dYI dx) and x{dy I dx) respectively, we are led to summarize our result 
as follows: 

Theorem I. If y satisfies the equation 


f£+P<*)g+9(*>V- 0. 


P (*) 


-14-2! 


i + “+ 


/ \ 9* i I 

9(*)-?+5 + 


then y and x (dy / dx) can be represented in the forms 

(15) p-a„(*)y + «u(*)*^, **“°*i(*) y+a “ ( * ) fs 

where Y is some solution of 

(16) ^+(- 1 + 7)£-? y = 0 

and <!„(*), a„(x), a„(i), a« (*) arc analytic at x = «, reducing to 1, 
0,0,1 respectively at x = « . /nffo cajtf r = 0 it is necessary to replace 

xdY . .. x(fy 

- j- &y lim - -7- . 
r dx * r±»T dx 

The equation (16) is obtained from (14) under the hypothesis r + 0. 
When r approaches zero, it is possible to make the solution ( Y, Z) of (14) 
approach any particular solution of that equation for r = 0, i. e., one may write 

Z-limfg. 

r_o r dx 

A slight modification in the proof must be made in case the second alter¬ 
native (9) is before us. In this event we may define hi (*), ht (*), In (*), 
l n (x) so as to satisfy the restrictions of the auxiliary theorem by writing 

instead of (10) 

y, = **■/„ (*), y. = l0 * 1 + ll ' ,x) ) ’ 






From this point on we may develop the argument as we did before, except for 
a mere difference of analytical detail. 

The equation (16), which comes from the linear system (14) by an elimina¬ 
tion of Z, may be termed the canonical form of equation (1). . . 

It is clear that a study of the solutions of (1) in the vicinity of x = «, 

Tram. Am. Malb. Soc. 81 
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reduces at once to a study of the solutions of this simple equation (16) which 
involves only the known constant pi and the product rs . The point x = 0 
is a regular singular point of (16) and of (14) with exponents Xi and X 2 by 
(13). The indicial equation at x = 0 is 


X*+ (pi — 1)X — r* = 0, 


so that we have the relations 

^17) Xi + X 2 = 1 — pi, Xi X* « — rs . 

One may easily verify that these relations continue to hold even in the second 
alternative (9) when the multipliers are equal. 


§ 2. The Formal Solutions and the Exponents. 

The equation (1) admits of two formal solutions * of the following types 


This fact may be verified by a direct substitution which permits one to de¬ 
termine s^, sf}\ ••• and J}', J?\ ••• uniquely so as to make s x (x) and 
8j ( x ) formally satisfy (1). Moreover the canonical equation (16) is also of 
this type (1), and in this case an explicit determination of these formal series 
Si (x), St (*) gives at oncef 


_ . . /. XiX 2 1 . X, (Xi + 1) Xj (Xi + 1) 

Si (x) = ^1 r • -+-1T2 




(18) S 2 (x) = e‘x-»ll + 


( 


Xi — 1) (Xj —1) 1 
1 x 


+ 


(X t - 1)(X, — 2)(Xj— l)(Xi— 2) 1 
1*2 x 2 



Now, putting aside for the moment the case r = 0, we see that the formal 
solutions of the linear differential system (4) are given by 


<ki ( x) 

y = Si (x), z = x ^ , 


y = s* (x)f 


z = 


ds 2 {x) 

X dx 9 


• A discussion of such formal solutions is given by Horn : Gewohnliche DifferenlialgUichungen 
beliebiger Ordnung, pp. 183-188. 

f An easy way to obtain the second series is to substitute y = t x x"*> w in (6), and in the 
resulting transformed equation put u> = 2^ • The simplification of the form of the 
recurrent relation between the successive coefficients in both series depends on (17). 
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and that these solutions for (14) are 


Y = S i(x), 


xdS x (x). 




xdS 2 (x) 
r dx 


It is also clear that each of these sets for Y, Z goes into the corresponding 
set for y, z, by the transformation (5). In fact the formal solutions Y ,Z go 
into formal solutions y, z of the proper form with the same leading terms 
since the functions an (x), a^ (x), a 2 i (x), a 22 (x) are power series in negative 
powers of x with leading coefficients 1, 0, 0, 1 respectively. It is directly 
demonstrable that the form and leading terms characterize the formal series 
for y, z completely, and hence the correspondence must be as stated. 

Therefore if we replace y by s, (x) in (15) and )’ by S, (x) respectively 
for i = 1, 2 the resulting equations are formally correct. These relations 
permit us to determine power series for au(x), ai 2 (x)/r, a 2 i(x)/r, 
a 2 2 (•*■) / r by means of the known series for j, (x) to be obtained from (1), and 
the series S, (x) given in (18). 

These latter series depends on Xi and X 2 alone, so that we arc in a position 
to actually obtain the functions a n (x), ai 2 (x)/r, a 2l (x), a 22 (x)/r 
in (15) as soon as Xi and X 2 are known. That is, once a set of exponents 
Xi, X 2 are obtained it will be possible to determine the transformation (5) ex¬ 
plicitly. We observe that, since o 22 (x) reduces to 1 at x = « , the power scries 
for a 22 (x) / r begins with the term 1/ r, which permits us to determine r; 
and that when r is found, s may be determined by means of the relation 
rs = — Xi X 2 . 

Likewise even if r is zero the equations (14) again admit of similar formal 
solutions, and by substituting in both sides of (5) corresponding formal 
solutions of (4) and (14) one may determine a n (x), ai 2 (x), a 2 j (x), a 22 (x), 
and s . 

It is essential therefore to determine the possible values of Xi and X 2 , and 
to know in what cases r may be zero. Concerning these exponents we know 
at present that 


(19) Xi = — 


2rl / — 1 


log pi, X 2 = - 


2*V- 1 


log P2, Xl + Xj— l“Pl. 


Any pair of exponents Xi and X 2 may be replaced by Xi + m and X 2 — m 
respectively where m is any integer, without destroying the relations (19). 
The multipliers pi, p 2 may be computed in various ways and we shall suppose 
these to be given quantities. 

In order to completely consider all possibilities we need to make a sub¬ 
division into two cases, the first of which is the following: If neither of the 
multipliers pi, p 2 reduce to 1 (Case A) the exponents Xi, X 2 are unrestricted save 
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as indicated by (19) and neither r nor sis zero. In this case both formal solutions 
of (1) diverge. 

To prove this we note that there is a transformation of the form (5) 

(20) y — bn (x) ?+ (*) Z=b»(c)Y+b n (x)Z, 

where b n (x), b n (x), b u (x), bn (x) are analytic at i = » and reduce to 
1 0,0,1 respectively at x = » , that takes the equations (14) into like 
equations in which the exponents Xj, X* are changed to Xt - 1, X* + 1 re¬ 
spectively. In fact if we write 


( 21 ) 




these equations (14) go into 

d± m X, (1 ~ _Ai ) 2 I ^ = lli+A>y +(l+ i_p 1 )Z 
dx s dx Xi 

in which — rs = X 2 X 2 is replaced by — ri - (Xi — 1) (X* + 1). It is 
to be observed that r and s are not zero, for that implies that either Xi or X 2 
is zero by (17) so that one of the multipliers is 1. 

By compounding the transformation (5) with the transformation (21) we 
obtain another transformation of the same form as (5) taking (4) directly to 
(21). Hence we have increased one of the exponents by unity and decreased 
the other by unity. By a succession of such steps we can obtain an equation 
(14) in which the exponents Xi, X* are replaced by Xi -f m , X 2 — m where m 
is any integer. This proves our first statement above. 

Now the formal series (18) diverge unless they terminate, and they cannot 
terminate since neither Xi nor X 2 is an integer, by hypothesis. But it is obvious 
that corresponding series s and S for (1) and (16) either converge or diverge 
together by (5). It follows that the two formal solutions of (1) diverge in 
this case. 

The second case is the following: If one of the multipliers, as pi, reduces to 1 
(Case B ) it is possible to take Xi = 0 and X* = 1 — p\. At least one of the 
formal solutions Si(x), s 2 (x) of (1) converges. If s x (z) diverges, then r 
but not s reduces to zero ; if s 2 (x) diverges, s but not r reduces to zero', if both 
si(x) and s 2 (x) converge, we may take r - s = 0. 

In this case one of the exponents is zero at the outset, or by a succession 
of transformations (20) it will be possible to reduce an integral exponent corre¬ 
sponding to the multiplier 1 to the value zero so that Xi = 0; we shall have 
then X 2 = 1 — Pi, by (17). 
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Also since we have rs = — AiX* * 0, either r or s is zero. 
If r = 0, the system (14) becomes 

(22) , d / = 0, x d ^ = sY+(.x+l-p,)Z. 


The two formal solutions of this system are clearly 


(23) 


y-i. 

y = o. 



. Pi — 2 , Cpi — 2)(pi — 3) 



z = *•**-*■, 


the second of which is terminating and so convergent. If s = 0 the system 
(14) has the two formal solutions 


y=i, z = o. 


(24) 


[ 


r«*ar*i 1 + r + 


Pi , Pi(Pi + 1) 




Z = e x z 1- ** 1 , 


by (18), and here the first series is terminating and convergent. In either 
case one of the formal solutions of (14), and hence the corresponding formal 
solution of (1) converges. If si ( x ) diverges, so will the corresponding solution 
of (16) and we must have the first case r = 0. Since s appears as a factor in 
the first series (23) we infer s * 0 in this case. Likewise if (z) diverges 
we infer r + 0, 8 = 0. 

It only remains therefore to show that in the case when both Si (x) and 
s 2 (x) converge we may take r = s = 0. We have already seen that either 
r or s is zero. Suppose we have r = 0, 3 + 0. An inspection of the first formal 
solution (23) shows that pi must have one of the values 2,3, • • • in order that 
the series for Z may terminate and converge. The first solution may be written 
Y = 1 , Z = — sP (x)/ x, where P (z) is a polynomial in 1 /z. If we make 
the further transformation 

Y = Y, Z = - S -P(x)Y+Z, 


of the form ( 20 ) where in (i), 6.1 (*), b tl (x), b tt (x ) are analytic at i_= 0 
and reduce to 1,0, 0, 1 respectively at z = co , the resulting system in Y , Z 
13 

I S =0 - 

of the form (14) with r = s = 0. But the transformation (5) and the above 
transformation are together equivalent to a single transformation of the form 
(5), so that we may take r = s = 0. Likewise we may dispose of the case 
r= $r0,s=0bya consideration of the second of the solutions (24). 
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§ 3. Explicit Solution of the Canonical Form of Equation. 

In order to study completely the solutions of (1) in the vicinity of x — 
it is necessary to solve the canonical form (16) of equation equivalent to the 
linear system (14). If either multiplier p„ P. is 1, we have essentially Case « 
and we may reduce the equation (14) to a form in which r = 0 or s - U. in 
either case the reduced system (14) is integrable by a quadrature, and we have 

Y=c, Z= e'*>—[d +e»JV*i""* dx ] if r = 0, 
(25) y= d + crfe-z^dx, Z = W*- ifa=0. 


Therefore the canonical equations (14) can always be solved by a single quad- 
rature if either Xj or X* is an integer. 

However when neither Xi norX 2 is an integer it is possible to effect the solution 
of (16) by means of the Laplace transformation. To simplify the form of this 
transformation let us write Y = *»> P in (12) and obtain the equation* 


(26) 


+ (1 + Xi — Xj — x) — — Xi ? 


dx 


0 . 


This equation is of the simple type to which Laplace first applied his trans 
formation, and using his method we find the formulas 


(27 ) Y t =*' f e"** 1 ” 1 (1 — M)T**dz 

Jit 

and likewise by interchanging Xi and X 2 , we find 

( 28) Yt~^fe"^-'(l -*)“*■& 


(< - 1 , 2 ), 


(t - 1, 2). 


Here the contours h and h are loop-circuits to the points z = 0 and z = 1 
respectively taken from z = « along a ray to these points, making a circuit 
in a negative sense and returning to « along the ray. The ray must be so 
chosen that the integrals converge. 

Under the conditions that neither Xi nor X 2 is an integer the two solutions 
of either set are linearly independent. 

• Horn has considered the application of the Laplace transformation to the equation 


(a 0 + 6 *z)+ (ai + 6ix)^ + (a t + btx) y = 0, 

which he reduces first to a normal form somewhat different from our equation (26): Mathe- 
matische Annalen, vol. 49 (1907), pp. 453-492. The reader who desires to see how 
the integral given by the Laplace transformation may be made to yield complete information 
about the solution Y is referred to this interesting paper. 
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It is now possible to give more explicit form to the relation (15) of Theorem 
I which we do in the following: 

Theorem I'. Two linearly independent solutions of an equation 


^+p(*)2+?(*)y = o. 


?(*) = -i+“+—. 


q(x) 


?* + £ + 

z* + z 3 + 


may be represented in the form 

yt : = x*' £ a , : ( x ) e" ^ (1 - 2 )“* dz + 6 . ( z) «" 2 X| (1 


z)- A, <fe] 

« - 1 , 2 ), 


where a, (x) and 6 , (z) are analytic at x = co and reduce to l and 0 respectively 
at x = oo , and Xi, X* are any determinations of Xi, X 2 satisfying the equations 
(18). TAw representation breaks down only when one of the multipliers pi, p 2 
reduces to 1 , and then it may be replaced by one of the two following: 

y x = a(z) + b(x)e’x l - p ' f e^x^dx, y 2 = b (x) e* x'~*', 

yi - a(z)/ e‘x-*'dx+b (z) e* x *~*', y 2 - a(x). 

To prove these relations it is only necessary to write in Theorem I 
an (x) = a (x), a 12 (x) / r = b (x) and to use for Y and Z the forms de¬ 
duced from (28) and (25). 


§ 4. Characterization of the Laurent Series for the Solutions. 

Let us now restrict ourselves to the non-exceptional case when both of the 
multipliers p \, p 2 are distinct from each other and from 1. In this case the 
equation ( 1 ) has a pair of solutions 3/1 (z), y 2 (z) of the forms 

yi ( x) = z*Wn (z), y 2 (z) = z*« hi (*) > 
where In (z) and h 2 (z) may be expressed as Laurent series 

/ll(x) = z A”*’. il,(*)= £ 4’ 1 *’. 

since these functions are single-valued and analytic for | z| ^ R. 

It is an interesting matter to determine the precise character of the coef¬ 
ficients t? in these series. To do so we make use of the relations obtained 
from the transformation (5) taking the linear system (4) to the canonical 
system (14). The first equation of transformation may be written 
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where F satisfies (14) and a (x) and b (x) are analytic at x = » and reduce to 
land 0 respectively at x = • . Let us express a (x) and b (x) in the form 
of power series in negative powers of x 

(30) »(*> = 1 + |“'X"' b(x) = ±b,x~’. 

It is clear that the special solutions F, and Y, of (16) which correspond to the 
exponents X, and X, go over into the solutions y, and y, respectively, so that 

we may write 

yt = a (x) Fi + xb (x) (i-i, 2). 

Rut the functions x - *' F,. x - *' F, admit of expansion in Laurent series in x 
convergent for all finite values of x, since (16) has no finite singular points except 
the regular point x = O with exponents X,, X,. 

If we write F = x“ F and substitute 

v-0 

in (26) we find that 

/ At , Af(At-H) 

y » - + i.(Xi-A*+l) 1 + i-2 (Xi-Xt+l)• (Xi-X,+2) ) 


(30) 


/ X 2 , Xr(X 2 +U_ ~2 _i_ ,..\ 

F, - **^1 + I.(X 1 -X,+Ij + l-2 (X,-X 1 +l)-(X l -X,+ 2) ) ' 

Putting in the explicit series for y it a (*), 6 (x), Y, in (29) and comparing 

coefficients we obtain the following: 

Theorem II. If the multipliers p ,, p, are distinct from each other and from 
1, and if Xi, X 2 are any pair of exponents satisfying the conditions (19), the coef¬ 
ficients Iff' in the Laurent series 

yi = x* 1 Z 


in which two particular linearly independent solutions yi, yi of (1) may be ex- 
ponded, have the form 

f;' = K rl + (ai+ ti (X.+ >• +1) ]L5 Ftl> + [ai+ (Xi+ p + 2)] L:' +!) + 
where 
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(32) 


(o __ Xi • (Xi 4- 1) • • • (Xi + v — 1) _ 

1 1 * 2 • • • v • (Xi — X 2 -f- 1) * (Xi — X 2 + 2) • • • (Xi — X 2 + v j 

<„ __ X 2 « (X 2 + 1) • • • (X 2 + v — 1) _ 

1 • 2 • • • v • (X 2 — Xi+ 1) • (X 2 — Xj-|-2) ••• (X 2 — X, + v) 


and a ¥ , b ¥ are a set of numbers such that \/a ¥ and j 7b ¥ are limited in absolute 
value for v - 1,2, 

The condition on a ¥ and b ¥ stated in this theorem is equivalent to saying that 
the series for a (x) and b (x) converge for | x| sufficiently large, according 
to a well-known test due to Cauchy. The form for l'** given in the theorem 
is sufficient to describe these coefficients completely. 

It is not difficult to show that a similar characterization may be made in 
the special cases not coming under the hypothesis of the theorem. 


§ 5. Functional Relations. 

The theorems which have been developed in the preceding paragraphs are 
based on a preliminary normalizing transformation of the dependent variable 
in (1). This disposes of all but two essentially arbitrary constants. The 
transformation of independent variable x = t -f ^ (£), where ^ (£) is analytic 
at £ = co , leaves (1) unaltered in form but leads to no additional normaliza¬ 
tion. This fact is expressed in the following: 

Theorem III. If the multipliers pi, p 2 are distinct from each other and from 
1, and if the function <p (x) has the form x+ \l/(x) where \p (x) is analytic 
at x = co , then we have for every solution Y (x) of (16) 

(33) Y (*(*))- a (*)y ( x) + 6 (t)-jf- 

where a (x) and b (x) are analytic at x = co . 

The proof is immediate. Such a transformation applied to (16) takes 
that equation into an equation of the form (1) for which p 1# p 2 are the 
same as for (16) and whose formal solutions clearly diverge (Case A) with 
the corresponding formal solutions of (16), inasmuch as the new formal solu¬ 
tions result from the original formal solutions by this same transformation 
x= v(x). Consequently according to the results of § 2 it is possible to 
take the canonical equation associated with this transformed equation as in 
Theorem I to be the same as the equation (16) in 1' itself. 

If now we replace y , a n (x ), and xa l2 (x) / r by Y (<p (x)), a (x ), and 
b (x) respectively in (15) we obtain the equation (33) of the above theorem, 
where a (x) reduces to 1 at x = ao . Moreover the Y on both sides of equa- 
t on (33) refers always to one and the same solution, except for a constant 
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multiplier; this is a consequence of the fact that Y (ip (x)) and Y^(x) 
undergo similar transformations when x makes a positive circuit of x - °° • 
Such a constant multiplier may be absorbed nto a (x) and 6 (*) so that the 
modified equation (33) has the desired form. 

It is manifest that simple analogues of Theorems II and III exist in t e 
exceptional case when the multipliers become equal or reduce to unity 
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THEOREM CONCERNING THE SINGULAR POINTS OF 
ORDINARY LINEAR DIFFERENTIAL EQUATIONS 

By George D. Birkhoff 

DEPARTMENT OF MATHEMATICS. HARVARD UNIVERSITY 
Preacoud to tbo Acodemr. October IS. |9I5 

In earlier papers I have considered the effect of a linear transfor¬ 
mation of dependent variables upon the solutions of ordinary linear dif¬ 
ferential equations in the vicinity of a singular point. 

This type of transformation led me to the notion of equivalence which 
is fundamentally important for the classification of singular points. 

Ordinary linear differential equations also preserve their form under 
an arbitrary transformation of the independent variable. I shall prove 
here that this second type has no additional significance for the purposes 
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of classification since the solutions arc carried over into the solutions 
of equivalent differential equations by such transformations. 1 The 
precise statement of the result is contained in the following 

Theorem. Let Y (*) be a matrix 3 offunctions y u (x) (t,j = 1,2, • ■ ■, 
each analytic for \ x\^r, with determinant Y (x) not zero for \ x | ^ r. 
A necessary and sufficient condition that Y (x) forms the matrix solution 
of a linear differential system 


dY 

dx 


A (x) Y, 


where the elements a {j (x) of A (x) are analytic (q = 0) or have a pole of at 
most the qth order (q > 0) at x = js that, for every function * (x) 
of the form 

f> (x) = x +1 x~* + m x ~*~ l H-, 


the matrix M (x) defined by the matrix equation 

Y (<? (x)) = M (x) Y (x) 
is composed of elements m {j (x) analytic at x= ®. 

Let us prove first that the condition is necessary. 

The function M (x) defined for any choice of *ix) is single-valued as 
well as analytic for |x| sufficiently large. In fact, when x makes a cir¬ 
cuit of x= ®, Y{x) is altered to F(x)C where C is a matrix of constants 
of determinant not zero. But since *>(x) - x remains finite as x becomes 
infinite, v> (*) will make essentially the same circuit if |x| be sufliciently 
great. Thus Y(*p(x)) changes to F(y>(x) )C, and Mix) is unaltered in 
value. 

Now the expression for Mix) 


Y (x 0 ) Y~ x (xj (x 0 = (*); *m - x) 


may also be written as a matrix product 

[Y (* 0 ) Y~' (*,)] [Y (x.) Y" (*,)] • •• [Y (x„_.) Y - 1 (*.))• 

Here we will assume that the values x#, x t , x*, .... Xm of x are 
equally spaced, each after the first differing from its predecessor by 
Ax= (xj Xm)/m. 

From the differential equation itself, it is clear that 
Y (x.) - Y (x 1+1 ) = !ix[A (x,-) -f £,] Y (x i+I ) (i = 0 zero, 1, • • •, tn - 1) 
where E% will have arbitrarily small elements if Ax is small enough. 
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m 

For \x\ taken large enough, it is also clear that wc will have an equality 
of the form 

I <*</(*) I <<* I** I (fj, - 1, 2 

for the elements of A(x). Hence if / denotes the matrix with elements 
6ij(6ij = 0, 5«=1), and if A denotes the matrix whose elements arc 

all equal to the positive constant a, we find that the elements of the 
matrix 

Y ( x { ) Y~ l (x (+l ) or I + Ax(A (*,)+£,) 
are less in absolute value than the corresponding elements of the matrix 

/ + 21 Lx \A | tff | f . 

Also on account of the form of <p(x) we have for |x| large enough 
I *o - x m | ^ k | x, | “ f , (A independent of i or x). 

and thence 

tn 

Substituting this value of Ax above, we conclude that the elements 
of Y{xi) Y~Hxi+\) are less in absolute value than the corresponding 
elements of 



A. 


m 


Thus, by the product formula, M(x) will have elements whose absolute 
value is less than that of the corresponding elements of 



whose elements are obviously limited in absolute value for all values of m. 

Consequently the elements of M(x) are limited in absolute value for 
U'l large enough, and are nccessaiily analytic at infinity. 

Let us show now that the stated condition is sufficient. 

To this end we will consider the family of functions 

v (x) =x + px~ q 

where p is a parameter. 

Defining M(x,p) by the equation 

Y(x + px-')=M(x,p)Y(x), 
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we observe that M{x,p) is made up of elements single-valued and ana 
lytic in x and p if 

|p|gl. 


inasmuch as in this event 

Moreover, in view of the stated condition, the elements imj(x,p) 
of M(x,p) are analytic in x at x = ®. 

Consider now the integral formula 


ifftyfop) 


—i r (t> p) jj 
2tv/-1 J *({“*) 


(|x|>l+r) 


where the integration is performed around the lixcd circle C: |{|- H r 
in a positive sense. This reduces to the Cauchy integral formula by 
the substitution x=\/x\ 1=1/?, which holds since m(x,p) is analytic 
for x^l +r and at x = ®. 

We now perceive from the form of the integrand that mu ( x,p) must 
be analytic in x and p for x = ®. 

Differentiating the equation of definition for M(x,p) with respect 
to p we obtain 

Y(x + px - *) -±M(x .,) K(*). 

dx dp 


The elements of dM(x,p)/dp are also analytic in x and p at .r = ®. Let¬ 
ting p approach zero we obtain, for |x| > 1 +r. 


dx dp 


which establishes at once that Y(x) is the solution of a differential 
system of the desired form. 

1 This result was partly known to me in 1908. For a special case, sec Trans. A mer. .\f olh. 
Soc., •*. 462-476 (1913), in particular pp. 475-476. 

1 For the elements of the theory of matrices here used see Schlesin^er, VorlesungtH uber 
linearen Diferenlialgleichungen, pp. 18-19. 
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INFINITE PRODUCTS OF ANALYTIC MATRICES 

BT 

GEORGE D. BIRKHOFF 

In a large part of the theory of functions of a single complex variable the 
matrix of analytic functions rather than the single analytic function must be 
taken as the fundamental element. This is certainly the case for the functions 
defined by linear difference and differential equations. 

The goal of the present paper is to show that the classical results of Weier- 
strass and Mittag-Leffler, treating of the formation of infinite products of 
functions with assigned singularities, admits of a natural extension to infinite 
products of matrices. The matrices considered will be square matrices of 
n 2 elements and of determinant not identically zero. 

The concept of equivalence, which I have developed elsewhere,! lies at 
the basis of this extension: Let 

A(x) - Ia, 7 (*)J and B(x) - [&</(*)! (»,> - I, •••, n), 
be two matrices of analytic functions, each analytic in the neighborhood of 
a point x - x 0 but not necessarily analytic at the point x 0 . If the matrix 
M (x) defined by the matrix equation 

A(x) = M(x)B(x) 

is composed of elements m,/(x) each analytic at x - x 0 and if the determinant 
13/ (ar) | of the matrix M (x) is not zero at x 0l then A (x) is equivalent to 
H(z)ati = z 0 (from the left).* 

From this definition it follows immediately that if A(x) is equivalent to 
B(x) at x = Xq , then B (x) is also equivalent to A(x ); if, further, B{x) 
is equivalent to C(x), then A (x) is equivalent to C(x). 

Two matrices equivalent to each other at a point have essentially the same 
type of singularity at the point. It is this fact which gives importance to the 
notion. 

Evidently any two matrices of functions analytic at x 0 are equivalent if the 
determinants of the two matrices do not vanish at the point. For this reason 
a point for w hich the elements of a matrix are analytic, and the determinant 

• Presented to the Society, December 28, 1915. 

f See, for example, Proceedings of the American Academy of Arts 
and Sciences, vol. 49 (1913), pp. 521-568; in particular p. 540. 

J For the elements of the theory of matrices assumed in the present paper 6ec Schlesinger, 
Vorlcsungen uber Uneaten Diflereniialglcichungen, pp. 1S-19. 
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of the matrix is not zero, will be called an ordinary point of the matrix. If 
the elements of the matrix are analytic, while the determinant vanishes, the 
matrix will be said to have a singular point of analytic type at x = x 0 ; if the 
determinant vanishes to the Arth order, this singular point will be said to be 
of multiplicity k. 

If the elements have poles of the kth order at most at x = x 0 , the matrix 
will be said to have a pole of the kth order ; at such a point the determinant of 
the matrix will be analytic or have a pole of order not greater than nk . 

Finally, if the elements are analytic in the neighborhood of a given point 
while some elements have an essential singularity at the point, and if the 
determinant of the matrix does not vanish in the neighborhood of the point, 
the matrix will be said to have an isolated essentially singular point there; if, 
however, the determinant does vanish in every neighborhood of the point, 
or if the point under consideration is a cluster point of singular points of the 
matrix not of analytic type, the matrix will be said to have a non-isolated 
essentially singular point there. 

It is to be recalled that in the theory of the factorization of an analytic 
function a cluster point of zeros does not play the r6le of an isolated essentially 
singular point. 


1. Factorization of polynomial matrices 


We will commence with the consideration of matrices whose elements are 
polynomials. By the order of such a matrix we shall mean the degree of its 
polynomial determinant in x. 

From the standpoint of factorization the simplest type is that of order zero 
(constant determinant). The inverse of such a matrix and the product of 
two such are also of order zero. These matrices are not singular in the finite 
plane.* 

A matrix of polynomials will have a single singular point of analytic type 
in the finite plane if its determinant is of the form c(x — x 0 ) k . This point 
will lie at x = x 0 and be of multiplicity k . 

An elementary matrix will be a matrix of the form 


( 1 ) 


U(x) = 


1 

0 

Cl 

0 

0 

1 ••• 

Ci 

0 

0 

0 ... 

• • • • 

(* - x 0 ) ••• 

• 

0 

0 

0 ... 

c* 

1- 


• Questions of factorization for such matrices have been considered by Hensel and Lands- 
berg, Theorie der algebraischen Funklioncn einer Variabcln, etc., p. 163. 
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and differs from the unit matrix only in some kth column where the successive 
elements are the following: 

Ci, c t , • • •, Cfc-i, x - x 0 , Ck+i , •••, Cn. 

Its only finite singular point will be one of analytic type and multiplicity 1 at 
x = x 0 . The ifcth column of the matrix U (x) will be termed its character¬ 
istic column. 

Lemma. The most general matrix F(x) with a singular point of analytic 
type at x - x 0 of multiplicity k may be decomposed into the product of a matrix 
Q(x), for which x 0 is an ordinary point if k = 1 or a singular point of analytic 
type and of multiplicity k — 1 if k > 1, and a second elementary matrix U (x) 
with singular point at x 0 , i. e., 

(2) F(x) - Q(x)U(x). 

Proof.— The determinant of F(x 0 ) is zero by hypothesis and thus we may 
determine a matrix of constants D of determinant not zero so that the product 
matrix F (*<>) D has the elements of some /th column all zero. To effect this 
it is necessary merely to determine the elements du,du, • • •, (Li (not all zero) 
of that column of D so that the n equations 

/.i ( xo)du + /,-*(a'o)dit + ••• “ 0 (i -1,2, •••,!») 

hold, and then to choose the remaining elements of D so that its deteminant 
does not vanish. But this set of n linear homogeneous equations in du, 
fat, • • •, dni can always be satisfied since the determinant of the coefficients 
is |F(*o)|- 0. 

Furthermore let us assume for the moment that the first of these quantities 
du is not zero and let us take / * 1. In this case the elements of D occurring 
in the n — 1 columns after the first may be taken to be 1 along the principal 
diagonal and zero elsewhere. Also the elements of the first column may be 
taken to be 1, rf*, da, • • •, d n so that 

1 0 ••• O' 

D = di 1 * * * 0 • 

d n 0 ••• 1. 

When D has been taken in this way, the matrix F (x) D has the property that 
the elements of the first column vanish at x = x 0 . In consequence, if we 
write 



x — Xo 0 • • • 

0 

F(z)Z> = <?(*) 

0 

• • 

1 ••• 

• • • 

0 

• 


. 0 

0 • • • 

1. 
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so that Q(x ) is derived from F(x)D by dividing the elements of the first 
column of F(x)D by x - x 0 , the matrix Q(x) is made up of elements 
analytic at x 0 . 

Moreover, if k = 1, the determinant of Q(x) is not zero at x = x 0 ; this 
follows from the determinant relation | F(x )| = | Q (a:) | (z — a- 0 ). Likewise, 
if k > 1, the same relation shows that Q(x) has a singular point of analytic 
type of multiplicity k — 1. 

If we multiply the above matrix equation through by D~ l on the right, we 
obtain 




0 

... o 

F(x) = <?(*) 

~dt 

1 

... o 



0 

... 0 . 


The second factor is evidently an elementary matrix U (x) while the first 
factor Q(x) has the properties demanded of it in the lemma. The first 
column of U(x) is the characteristic column. 

In the case when du is zero some one of the quantities du, • • •, d ni will be 
different from zero. If du is not zero, for example, we may take l = 2 and 
factor F(x) into the product of a matrix Q(x) and an elementary matrix 
U(x) whose second column is its characteristic column. In general if the 
Xth of the quantities du, • • •, d nl is not zero we may take l = X and proceed 
as before. 

This completes our proof of the lemma. 

It is evident, in the simple case when F(x) has a singular point of multi¬ 
plicity one at x = x 0 , that there may be n distinct ways of separating out an 
elementary factor, namely by allowing l to range from 1 to n. Hence the 
theory of factorization for n > 1 diverges greatly from that for the case 
n = 1 inasmuch as the mode of factorization is not unique. 

The matrix Q(x) of the lemma has the same singular points in the finite 
plane for x * .r 0 as ?(*). Furthermore such a singularity is of the same 
kind for both matrices, i. e., is of analytic type with the same multiplicity, or 
a pole of order k, or an isolated essential singularity, or a non-isolated essential 
singularity for both. This is an obvious consequence of (2) and the inverse 
relation Q(x) = F(x) U~ l (x ). 

We are now in a position to establish at once the following theorem: 

Theorem I. An arbitrary matrix of polynomials P(x) of order m whose 
determinant is factorable into the product 


c(x - *i)(x - Xi ) ••• (x - X m ) (c * 0) 

may be expressed as a product 
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Po(x)U\(x)Ut(x) ••• Um(x) 

where Po(x) is a polynomial matrix of order zero and where U t (x), U 2 (x), 
••• , U m (x ) are elementary matrices with singular points at x Xt x 2 , •••,**, 
respectively. « 

It is obvious that a successive application of the above lemma enables us 
to write P (x) in the stated form. 

Suppose now that a polynomial matrix P(x) be expressed in any way as a 
product V x (x)V 2 {x) ••• V m (x) of m polynomial matrices V\(x), V 2 (x), 

• • •, V m (x), each having its only finite singularity at distinct points x lt x t , 

• • •, x m respectively. A particular product of this sort can be obtained at 
once from the product for P(x) of Theorem I. 

The most general factored form V l (x)V i (x) ••• V m (x) of P(x) with the 
same properties is defined by the equations 

V x {x) - V l (x)R l (x), 

V 2 (x) = Rr , (x)V t (x)R t (x) t 
• ••••• 

V m (x) - R-L l (x)V m (x), 

where R\{x), • • •, R^i(x) are arbitrary polynomial matrices of order zero. 

It is clear that the indicated_general factored form is a possible form. 
Further, any factorization Vx(x)V t (x) ••• V m (x) leads at once to the 
matrix identity 

V 2 (x) ... V m (x)V~'{x) ... Ff*(x) - VT'(z)Vi(x). 

The matrix R x (x), as given by the right-hand member, has its only possible 
finite singularity at X\, while, as given by the left-hand member, it has its 
only possible singularities at x 2 , • • • , x m . Hence Ri(x) is not singular in the 
finite plane and must be composed of polynomial elements. This gives us 
the first equation. 

By use of the relation just obtained we find 

V 2 (x) ... V m (x) = R l (x)V t (x) ... V m (x) t 
and from this we get further 

VAX) ••• V m {x)V?(x) ... Vr'(x) = VT'(x)RAx)VAx). 

The matrix given by either side of this equation must, for a similar reason, 
be a polynomial matrix R 2 (x) of order zero. This conclusion leads us at 
once to the second of the given equations. 

Continuing in this way we derive all of the desired equations in succession. 
It is interesting to consider a particular resolution in elementary factors 
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possible in general when the elements of P(x ) are polynomials of the /ith 
degree and when |P(x)| is a polynomial of degree n\x = m with distinct 
roots X\ , x<t , • • •, Xm . 

If we remove an elementary factor corresponding to x = x m for which the 
first column is the characteristic column, we thereby only alter the first column 
of P(x). In fact we obtain the quotient Q(x) of the lemma from P(x) 
by multiplying on the right by U~ l (x), which is equivalent to adding a con¬ 
stant multiple of the second, third, • • •, last column to the first column, and 
dividing that column by x — x„. Thus the degree of the first column is 
lowered by unity, while the other columns are unaltered. 

In the same way we may lower the degree of the second column by unity, 
by the removal of an elementary factor corresponding to x = x«_i for which 
the second column is the characteristic column. 

By this process, after n steps, we lower the degree of all n columns of 
P(x) by unity by removing n elementary factors. 

Repeating the process we can reduce the quotient to a constant matrix in 
ny steps. Every step can be carried out in the general case, i. e., if certain 
particular algebraic relations between the coefficients of P(x) do not hold. 

Thus the matrix P 0 (x) of Theorem I of order zero can be taken to be a 
matrix of constants in this case. 

This result shows that one can not expect to obtain a simpler normal form 
for a factor with one singular point than the elementary matrix. For we have 
used the ny roots h , • • •, x„, then»i elementary factors each involving n — 1 
arbitrary constants, and a constant matrix with n 2 constants. Thus there are 
involved n 2 (y + 1) explicit constants in the factorization of P(x ), precisely 
as many as there are in the polynomial matrix P(x). No further simpli¬ 
fication can be expected since no superfluous constants have been employed. 

2. Equivalence problem for polynomial matrices 

In the first paragraph we solved the simple problem of factorization of 
polynomial matrices. The converse problem of constructing a polynomial 
matrix with given factors is solved merely by writing down the product of the 
factors. 

But it is possible to modify the point of view slightly. Every polynomial 
matrix with singularities at the distinct points x lt • • •, z* is by the lemma of 
§ 1 equivalent to a polynomial matrix F,(x) at the point x, whose only finite 
singularity lies at x< for t = 1,2, • • •, m . 

We are thus led to formulate a converse problem which may be stated as 
follows: 

Equivalence problem. Given polynomial matrices Fi(x), •••, F m (x), 
each having its only finite singular paint at the distinct points Xi, • • •, x m re- 
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spectively; to construct a polynomial matrix P(z) equivalent to these matrices 
at Xi, • • •, x m respectively but with no finite singular points other than x x , • • • , x m . 

Before proceeding to its solution we may note that the formulation of the 
problem is symmetrical with respect to all the singular points involved. 

An explicit solution may be obtained by the aid of a simple notation. Let 
[7*(x)J| denote a polynomial matrix, with its only finite singularity at £, 
to which the polynomial matrix T(x) is equivalent at the singular point 
x = £ ot T(x). The lemma of factorization of § 1 shows that, since T(x) 
is composed of elements analytic at x = £, a matrix l T ( x )) ( given as a product 
of elementary matrices, will always exist. If now we write for brevity 

Px(x) = [F,<*))„ = V l (x), 

Pt(x) - [V t (z)PT'(x)U, 
#•••••• 

P m (x) - [V m (x)PT'{x) ••• PZ !,(*)],., 

P(x) - P„(x)P,*_,(x) •.• Pi(x). 

To prove that P(x), as defined in this manner, has the desired properties 
we begin by observing that P(x) is certainly equivalent to V\{») at * - x x 
since the product P m (x) • •• Pj(x) is not singular atx«X|, 

Next let us proceed to the point x*. By definition of P 2 (x) we may write 

P,(x) - A(x)V t (x)PT'(x) 

where A{x) is not singular at x = x,. Consequently, on substituting in 
this value for Pj(x), we see that 

P(x) - P„(x) ... Pz(x)A(x)V t (x). 

But Pm(x), Pa(x) and A(x ) are not singular at x = x 2 so that P(x) 

is evidently equivalent to r*(x) at x = x 2 . 

In precisely the same way we proceed with the proof that P(x) is equiv¬ 
alent to V)(x), V m (x) at x lf • x m respectively, and thus forms a 
solution. 

The most general solution P(x) of the equivalence problem is P 0 (x)P(x) 
where Po(x) is any polynomial matrix of order zero. 

In the first place it is clear that P (x) as so defined is a solution of the equiv¬ 
alence problem. Moreover, consider any solution P(x). At any singular 
point as x x we have 

P(x) = A(x)V x (.x) t 
P(x) = B{x)V x {x), 

where A (x) and B(x) are not singular at x = x\. Using these values of 


(3) 

a solution is 

(4) 
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P and P we conclude that the matrix P ( x) [ P (x) ]~ x is equal to B ( x ) A -1 (x) 
and thus is not singular at x x . Similarly this matrix is not singular at x 2 , 
• • •, x m and also of course at any other point of the finite x-plane. Since 
P(x ) by hypothesis is a polynomial matrix the matrix Po(x) under consider¬ 
ation must be a polynomial matrix of order zero, which we were to prove. 


3. Factorization of matrices of entire functions. First method 

Let E{x) be a matrix of entire functions. Its finite singularities are all of 
analytic type, and fall at the points Xi, x±, • • •, at which the determinant of 
E(x ) vanishes. We take these points to be arranged in some arbitrary order 
and assume that they are infinitely many in number. It is clear that we have 
lim x m - » . 

Suppose that V x (x) is some polynomial matrix with unique finite singularity 
at Xi, to which E(x) is equivalent at x»; for instance V x (x) may be a product 
of elementary matrices with singularity at x x , as follows from the lemma of 
§ 1. The matrix E(x) Ff 1 (x) is a matrix of entire functions without a sin¬ 
gularity at x = X! but with its other finite singularities at x 2 , x 3 , • • •. At x 2 
this modified matrix is equivalent to a polynomial matrix F 2 (x) with unique 
finite singularity at x*. Forming now the matrix of entire functions 
E(x) V~ l (x) V ~ 1 (x), which is not singular at Xi and x 2 , we are led to K a (x), 
F 4 (x), ••• in similar fashion. 

It is therefore possible to find such polynomial matrices Fi(x), V 2 (x), 
••• that E(x) is equivalent to Fi(x) atxi.to F 2 (x)Fi(x) atx 2 ,and so on. 
The following theorem shows that matrices E(x) of entire functions having 
this property exist for any choice of V x (x), V t (x), ••• with a single finite 
singularity of analytic type at x lt x 2 , • • • respectively. 

Theorem II. If Fi(x), F 2 (x), ••• are 'polynomial matrices each with a 
single finite singularity of analytic type at the distinct points x x , x 2 , • • • and if 
lim x„ = « , there exists a matrix of entire functions equivalent to the product 
F m (x) ••• Fi(x)afx m (m = 1,2, •••) and of the form 

(5) E(x) = lim Q m (x) F„ (x) ••• F,(x), 

where Q m (x) (m = 1, 2, •••) is a matrix of entire functions without singu¬ 
larities in the finite plane. 

Proof. Let us first attempt to choose Q ,(x), Q 2 (x), ••• so that the 
sequence of matrices 

^(3r) = ^(i)F m (x)...F 1 (x) 

converges to a limit matrix as m approaches infinity. Now we have clearly 

(6) iW*) = [7+ &,(*)][/ + S^,(*)J... [/ + «,(*)]«,(*), 
where I + S„(x) = Q„+i(x)V m+l (x)Q-' (x). 
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It will therefore suffice to establish that the matrices S m (x) can be made 
to have elements of as small absolute value as desired within a circle of radius 
less than x m with center at the origin. This will be established by means of 
the following lemma: 

Lemma. If F ( x ) be a matrix without singularity in some closed continuum C 
in the finite plane, it is possible to find a matrix F X (x) without singularity in the 
finite plane and such that the absolute ralues of the elements of the difference 
matrix F(x) — F x (x) are arbitrarily small throughout C. 

A very simple proof of the lemma may be given as follows. The elements 
of the matrix 


dF(x) 

dx 


F-'(x) = A(x) 


are not singular in C . The elements a,/ ( x ) of A ( x ) may then be represented 
by series of polynomials which converge uniformly throughout C, by a well- 
known theorem due to Runge.* Hence we may break off the terms of these 
series at such a place that the elements of the resulting matrix A\(x) differ 
from those of A (*) by a uniformly small quantity throughout C. 

Consider now the matrix differential equation 

* F l(*) i (l \F(v\ 

~ -j—- - i4i(*)ri(x). 


By comparison with the equation for F, 


dF(x) 

dx 


A{x)F(z), 


we infer that a solution F x (x) of the preceding equation may be found whose 
elements are analytic and differ from those of F (x) by as little as we please 
throughout the given continuum C. 

But the differential equation for F x is without finite singular points so that 
|/ , i(:r)| nowhere vanishes.f Hence the matrix F\(x) has the desired 
properties. 

Let us now return to the proof of the theorem. Suppose that the matrices 
Qi, • • • ,Q m have been so selected that the matrices S it • • •, S^-i determined 
by this choice have their elements in absolute value less than assigned positive 
quantities a\, • • •, Om-i respectively within given circles of radii less than 
| x* | , • • •, | Xm | . We shall now show that Q^+\ can then be so selected that 
the corresponding matrix S m has its elements less in absolute value than a 
prescribed positive quantity a*, within any given circle of radius less than 
| Jm+l|- _ 

•Acta Mathematics, vol. 6 (1884), p. 229. 

t Schlesinger, loc. cit., p. 23. 
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To this end we observe that the matrix 

Q^x)VzU(x) 

will not be singular except at x = Xm+i- Thus, by the lemma above, a 
matrix Q*+i(x) without finite singularities may be chosen so that the dif¬ 
ference matrix 

«■*!(*) “ QmWV-UM 

has elements of as small absolute value as desired for |x| ^ d m < |*m+i|- 
Hence the same is true of 

S m (x) = Q^ l (x)V m+l (x)Qz l (x) - 1, 

as we see by multiplying the preceding expression on the right by Vm+\(x)Q^' (x). 

Thus we have a method of assigning Qm+i(x) as soon as Q m (x) has been 
chosen, in such a way that the elements of S m are less than a positive constant 
a*, for \x\< dm < |awi|» where ai + a* + • • • is a convergent series. More¬ 
over, since lim x„ = » we may take lim (L, = *> also. 

But the elements of the infinite sequence of matrix products 

IUx(x)QT'(x) - [/ + «.(»)] (/ + S^ 1 («)]... I/ + Si<*)) 

(m - 1,2, •••) 

will certainly converge absolutely and uniformly to the elements of a limit 
matrix if the elements of S m (x) are less in absolute value than the mth term 
of a convergent series of positive terms.* Furthermore the determinant of 
this limit matrix is the limit of the product of the determinants of the factors 
and the sum of the logarithms of these factors will form a convergent series. 
Hence we infer that lim Rm(x) is a matrix of entire functions E(x) of the 
form (5) and of determinant not zero save at x\, xt , • • • at which points E (x) 
is equivalent to Vi(x), Vt(x)Vi(x) , ••• respectively, so that E(x) has 
the desired properties. 

The most general matrix E{x) of entire functions icith its singularities at 
Xi, X 2 , • • • and equivalent to V\ (x ), V%{x)Vi(x), • • • at these points respec¬ 
tively has the form Q(x) E (x) where Q(x) is a matrix of entire functions with 
no finite singular point. This fact is immediately evident. 

In the case n = 1 the matrix Q m (z),as determined by the method of the 
lemma, is merely an exponential with an exponent which is polynomial in x. 
Thus we have a thoroughgoing generalization of the factorization theorem in 
the case n = 1, but one which appears much less explicit. Furthermore, the 
determination of each convergence factor is made to depend on the 

previous factor Q m (x). 

• The proof for n > 1 is not essentially more difficult than in the familiar case n = 1. 
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4. Factorization of matrices of entire functions. Second method 

A more explicit form of factorization can be given by the aid of an extended 
elementary matrix 

10 - WrJ 

0 1 - c ’(rJ. 

(7) U(x) =. 

0 0 • • • ( 1 - - 
\ T 0 


0 o ... [„(JJ] 

where 

( 8 ) Pr(z) - x • •• +J. 

This matrix differs from the unit matrix only in some /th column and reduces 
essentially to an elementary matrix for r ■■ 0 (compare with (1)). 

It is apparent that for a given value of r the lemma of § 1 may be extended 
to this type of matrix; i. e., a matrix F (x) with singularity of analytic type 
at x = To of multiplicity k may be written as Q(x) U (x) where Q has its 
elements analytic at x 0 , and has a singularity of analytic type of order k - 1 
at * - t 0 if k > 1, or is not singular at if k - 1. 

In fact, if we multiply F(x) on the right by a matrix 




which takes the same form as D of § 1 for x = x 0 and in which the /th column 
contains the d's, we may reduce the elements of the /th column of F(x) to 
zero at x = x 0 (for some /). Consequently if we write 
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0 ••• 

0 

0 

1 

0 

• 

F(x)D(x) =Q(x ) ^ 

0 ••• 

• • • 

e»**> ( 1 -- 
\ x 0 

0 

0 ••• 

0 



the elements of Q (x) have the desired properties, since Q (x) is derived from 
F(x)D(x) merely by dividing the /th column by 


£*<«/*0» 



Multiplying through the above equation by the inverse of D(x) we have 
F(x) expressed as the product of Q(x) and an extended elementary matrix 
V (x) like (7) with Ci = — d x , c* =* — d*, • • •, c„ — — d* . 

It is to be especially observed that the choice of the quantities d, and ac¬ 
cordingly Ci is independent of the choice of r which does not enter explicitly 
into D(x) forx - ar 0 . 

We are now prepared to demonstrate immediately the following second 
factorization theorem for matrices of entire functions: 

Theorem III. Any matrix of entire functions E(x) with its singularities at 
the distinct points Xi, X\, • • • may be written in the form 


E 0 (x)\\mU m (x)U^ l (x) ... Ui(x) 

mmm 


where Eo(x) is a matrix of entire functions of determinant nowhere zero and where 
Ui (x), Ut(x), ••• are extended elementary matrices of the form (7) such that 
the finite singular point of U\{x), U t (x), •••, Uk(x) lies at X \, that of 
Uk+i(x), • • • Ui(x) lies at and so on. 

Proof. The possibility of selecting extended elementary matrices U i(x), 
•••,Uk(x) with singular point at x\ , such that the product U k ( x ) • • • U i(x) 
is equivalent to E(x) at x = x lf is immediately obvious. In fact we may 
determine the product by successive application of the extended form of the 
lemma of § 1, so that it is equivalent to E(x) at x\. Consider now the 
product 

E(x)lU k (x) ... Ui(x)r l , 


which is a matrix of entire functions with its finite singular points at x it 
x it .... Treating this matrix at x 2 as we did E ( x ) at x 2 , we can find a second 
product of the form U k +i(x) •• • Ui(x) to which the product matrix under 
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consideration is equivalent at x 2 • Proceeding in this way we obtain a sequence 
of matrices Ui(x), U 2 (x), 

We can, in addition, secure the uniform convergence of the elements of 
the product matrix 

U m (x)U^t(x) ••• l\(x), 


for lim m = » , within any arbitrary circle |ar| = p in the complex plane. 

For we have noted that, at each step in the determination of the successive 
factors, the constants c Xt c it • • •, c* are chosen independently of the integer 
r. Thus, at each step after these constants have been chosen, we can take r 
so large that the elements of the /th column of that factor, 






differ from the corresponding elements of the set 


0 , 0 , •••, 1 , ... 0 

within any assigned circle |x| - p <\x k \ by an arbitrarily small amount. 
Hence the elements of the corresponding factor U m (x) can be made to differ 
from the elements of the unit matrix by arbitrarily small quantities throughout 
this circle. 

It is therefore apparent that the factors U i (*), U 2 (x), • • • can be so taken 
that the limit of their product exists uniformly within every fixed circle with 
center at the origin in the complex plane. The elements of this limit matrix 
Ei(x) are of course entire functions, and it has the desired equivalence proper¬ 
ties at xi, x 2 , • • •. The matrix E 0 (x) defined by the equation 

E(x) = E 9 (x)E l (x) 

is a matrix not singular in the finite plane. 


5. Equivalence problem for matrices of entire functions 

The results of § 3 permit us to solve the following problem: 

Equivalence problem. Given polynomial matrices V x (x) t V 2 (x), ••• 
each with its only finite singular point at the distinct points x Xl x 2 , ••• re¬ 
spectively, to construct a matrix E(x) of entire functions equivalent to V x (x), 
K 2 (a:), ••• at xi, x 2 , • • • respectively but with no other finite singular points. 

In fact, let us use the bracket notation [T(x)] e as before to denote some 
polynomial matrix equivalent to T (x) at x = £ but otherwise not singular in 
the finite plane. Let us write 
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V^x) = [V l (x)U = V l (x) t 
v 2 (x) = lVt(x)Vr'(x)u, 

V 3 (x) = [V t {x )VT'{x)V;'{x)U, 


Consider now the matrix of entire functions 

E(x) - \JrnQ m (x)V m (x) ... V l (z), 

where the matrices Q m (ar) are chosen as in Theorem II so that the limit is 
approached uniformly. It is readily seen that the product V m (x) • • • Vi (x) 
is equivalent to K,(x) at z — x, for t = 1,2, • • •, m. This is obvious for 
i * 1. Furthermore at x = x 2 we may write 

V 2 (z)-A(x)V t {x)V7 x (x) 

where A ( x)_ is not singular at x - x 2 . It follows at once that V 2 (x) V\ ( x ) 
and hence V m (x) ••• V x {x) is equivalent to V 2 {x) at x = x 2 . Continuing 
in this way we may establish the truth of the statement made for i — 1,2, 
•• •, m. 

The definition of E(x) now shows that it has the stated properties. Thus 
the equivalence problem has been solved. 

The most general solution of the equivalence problem w E 0 {x) E{x) where 
E 0 (x) is an arbitrary matrix of entire functions of determinant not zero in the 
finite plane. 

6. Matrices with a finite number of essential singularities 

Thus far we have restricted attention to matrices with singular points of 
analytic type. We propose now to consider the question of the factorization 
of matrices with isolated essential singularities. We will restrict attention at 
first to the case in which there are only a finite number of isolated essentially 
singular points. 

I shall assume the truth of the following lemma: 

Lemma. Let L(x) be a matrix of functions for which x = ® is an isolated 
essentially singular point. Then it is possible to write L{x) in the product form 
A ( x) E ( x) ( x k * ha ) where A(x)isa matrix of functions not singular at x = « , 
where E(x) is a matrix of entire functions not singular in the finite plane, and 
where k\, k 2 , • • •, k n are integers. 

For the proof of this lemma the reader is referred to the Mathematische 
Annalen, vol. 74 (1913), pp. 122-133. With its aid we may at once prove 
the theorem which follows: 

Theorem IV. Any matrix F ( x) with a finite number of isolated singularities 

Tram. Am. Math. Soc. 26 
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(essential or non-essential) at the distinct points X\,X\, • • •, x*,, x^+x = ® , and 
no others, can be written as a product of factors E(x) W m (x) fV^i (x) ••• 
W\(x) where these factors W are 

W,(x) = 

7/ere E(y), E m (y ), • • •, £» (y) are matrices of entire functions not singular 
in the finite plane and Up (l = 1,2, ••• ,m;j = 1,2, • ••, n) are integers. 

Proof. It is clear that F[(xi + 1/|)] has an isolated essential singularity 
at $ - oo and so, by the above lemma, may be written as 

A, ({)£.({) ({*/’««), 

where E\ (() is a matrix of entire functions which is not singular in the finite 
plane. Transferring back to the variable x, we obtain the decomposition 

where F\ (x) is not singular at*»i|. It follows that F\(x) has singulari¬ 
ties only at xj, • • •, x m , and aw+x = ® . 

By the same process we find 

where Ft(x) is not singular save at x a , • • •, and Zm+i «= ® . 

Proceeding successively in this manner we obtain a set of similar equations 
by introducing matrices Ft, • • •, F m . The last equation is 

i(x) - F m {x) - a.)-*}"*,,), 

where F m (x) is not singular save at infinity, and hence is a matrix E(x) of 
entire functions of the desired type. 

From this succession of equations we get the stated decomposition at once. 
It is not difficult to infer that the above decomposition is essentially unique. 

7. The corresponding equivalence problem 

We can now pass at once to the solution of the corresponding problem: 
Equivalence problem.* Given matrices Gi(x), •••,(?„( x) each with one 
isolated singularity at the distinct points X\, • • • , x m respectively in the finite plane 
and with no other singular points; to construct a matrix F(x) equivalent to 

•Cf. Proceedings of the American Academy of Arts and Sci¬ 
ences, vol. 49 (1913), pp. 521-568; in particular pp. 540-541. 
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matrices Gi(x), G m (x) at x x , •••, x* respectively, and otherwise not 
singular save for a possible pole at infinity. 

Proof. We follow the method used in §§ 2, 5 in solving the similar problem 
for matrices of polynomials and entire functions. 

Suppose that T(x) has an isolated singularity at x = £. Let [ T (x) ]* be 
used to designate the matrix 





to which, in accordance with the lemma of the preceding section, F(x) is 
equivalent at x = £. This matrix has no other singularity save for a possible 
pole at x — oo . Here E(y) is an entire function of y and not singular in the 
finite plane. 

If now we write 






we infer at once that the matrix F (x) defined by 

F(x)^H m (x)---H l (x) 

has the stated properties (see § 2). 

The most general solution of the equivalence problem is Q(x)F(x), where 
Q(x) is any polynomial matrix of constant determinant. 


8. The matrix with general singularities 

The factorization theorems of §§ 1, 3, admit of a further extension as follows: 
Theorem V. Let x x , x 2 , • • • be an isolated set of points in the finite complex 
pane with a set of limit points [c), one of which lies at infinity in the plane. 
Further, let E x (y), E 2 (y), ••• be arbitrary matrices of functions of the form 
E(y)(y k ' 5,7) where E(y) is a matrix of entire functions not singular in the 
finite plane, and k\, • • •, k n are integers. Then there exists a matrix G{x) of 
analytic functions given by a formula 

where Q m (x) is not singular in the finite plane, such that G (x) is equivalent to 
Em[\/(x — Xi)) • • • Ei [ l/(x — x 2 ) ] at x m (m = 1,2, • • •) and is otherwise 
not singular save at the points of [c]. 
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Proof. We shall follow closely the method used in proving the analogous 
Theorem II (§ 3). 

For convergence of a formula for G(z) of the type indicated, it is sufficient 
to establish that (£*+ 1 ( 2 ) can be so chosen that, in a region outside of an 
arbitrary small distance of j\*+i and within a circle of arbitrarily large radius 
with center at this point in the complex plane, the elements of 

(9) <?-+,(*)£*. ( 3 -^) 

can be made to differ in absolute value from the corresponding elements of the 
unit matrix by less than an arbitrarily assigned positive quantity. 

But this is readily done as follows. The matrix 

is obviously not singular except at x^ x and at inffnity. Hence by the lemma 
of § 3 we can 6 nd a matrix Qw+iix), not singular in the finite plane, whose 
elements differ from those of the above matrix by less than an arbitrarily 
assigned quantity for 

rS\x- *.+,| £ R, 

r, R being arbitrary positive quantities. 

Consequently we may make the elements of the matrix (9) differ from those 
of the unit matrix by less than an arbitrary positive quantity in this region 
for r as small and R as large as desired. 

It is therefore clear that Qi(x), Q t (x), • • • may be taken in succession so 
that the limit in the theorem exists uniformly so long as x lies within any circle 
of arbitrarily large radius with center at the origin and not within an arbitrarily 
small distance of any point of [c]. 

The matrix G(x) thus defined has clearly the desired property. On the 
basis of this theorem we are led at once (as in § 5) to a solution of the problem: 

Equivalence problem. Given an infinite number of points x x ,x 2 , • • • in the 
finite plane with a set of limit points [c], one of which lies at infinity, and matrices 
Ei (y), E 2 (y), • • • of the form E(y) ( y k> 5,y ) where E(y) is a matrix of entire 
functions, not singular in the finite plane, and k x , • • • , k n are integers; to con¬ 
struct a matrix G (x) equivalent to E x [l/(x - x x )], E 2 [ 1/(x - x 2 )], ••• at 
x x , x 2 , • • • respectively, and otherwise not singular save at the points of [c ]. 

The most general solution of this equivalence problem is M(x)G(x) where 
M (x) is an arbitrary matrix of analytic functions whose singularities all fall at 
the points of (cj. 
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9. An application 

I shall give now a simple application of the preceding theorems. This 
application may be stated as follows: Let e(x ) be an entire function and let 
e 0 (x), «i(*)» «*(*)* e *( x )> denote the functions x, e(x), e[e(x)), 
e{e[e(.x)]) t ••• respectively. Furthermore let F (x) denote an arbitrary 
matrix of single-valued analytic functions. A matrix G ( x) of single-valued 
analytic functions can then be found whose singular points fall at those of F[e k (x)] 
(it = 0 , 1,2, • • •) and at the limit points [c] of this set, and such that G (x) 
is equivalent to G [e (x)] F (x) save at these limit points. 

Let us write down for reference the symbolic infinite product to the left 

••• F \ F (*i(x)] F Uo(x)]» 

The singular points of any factor F [e„(x)l not belonging to [c] are isolated 
singular points. 

Consider now any such singular point £ not a point of [c). It is possible to 
take a positive integer k so large that all the factors before the kth in the 
above product are not singular at £ nor in its neighborhood, since £ is not a 
point of (c]. 

We shall demand of a matrix G (x) that it be equivalent to the last k factors 
of the product 

A [**_i(x)l ••• A Mx)] 

at every point x. 

It is clear that if k be increased G (x) will still have the same property since 
the factors introduced are not singular at £, and appear on the left of the 
previous factors. 

The set of such points £ is clearly numerable. In fact any set of isolated 
points is numerable. 

According to § 8 a matrix G (x) may be found which satisfies these relations 
of equivalence at the singular points £ of F (e m (x)) (m = 0,1, •••) and which 
has no further singular points save the points of (cj. 

This matrix G(x) will have the desired property that G (x) is equivalent 
to(?(e(x)]F(x) save at the points of [c 1 . 

In proving this statement it is first to be observed that if x is a singular 
point of F[e m (x)], and if a value y exists such that x = e(y), then y is a 
singular point of F (e«+i (y) ] . For we have identically 

/[^(x)]-F[iUfi(y)] f 

where x = e{y) . Hence, if the matrix on the right is not singular at y , the 
elements of the equal matrix on the left are analytic in x at x, since they can 
not be multiple-valued functions of x. Moreover, since as x approaches x, 
the variable y approaches y, the determinant of the right-hand side will not 
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approach zero in this case. Thus /'[^(x)] would not be singular at £, 
contrary to hypothesis. 

The converse statement is obviously true, i. e., if F [e^x^y)) is singular 
at y and if £ = e(y), then F [e*(x)l is singular at £. 

Thus we derive the precise totality of singular points of successive factors 
of the symbolic infinite product from the singularities fo of the first factor 
F(x) by forming e_i(£ 0 ), «-*(*<>), ••• where e_i(x), e_*(x), ••• denote 
the inverse functions to cj (x), e* (x), • • • respectively, which are of course in 
general multiple valued. 

Consider now any point not belonging to [c]. 

If this point £ is not a singular point of some matrix F[e«(x)l then Q(x) 
is certainly not singular at $, as has been stated. Moreover «({) is not a 
singular point for (?(x), inasmuch as in the contrary case some /’(e m (x)] 
is singular at e(£) and thus the corresponding F [e*+ i(x)) is singular at £. 
Consequently 0 [ e (x) ] is not singular at £. It follows that G (x) is equivalent 
to G [e(x)]F (x) at any such point £. 

It remains to consider a point £ for which some F [e m (x)] is singular. At 
such a point G(x) has been chosen to be equivalent to a product 

Fl*k(z)]Ple h - l (z)]--. Fle 0 (x)) f 

where k has been taken so large as to include all the factors singular at £. 
The point e (£) will be a singular point for F [ e^x (x) ] unless m is zero. Put¬ 
ting aside this contingency for the moment, we see that G (x) is equivalent to 

/ , [^ 1 (x)]F[f*_ l (x))...F[f 0 (x)] 

at e (£). Consequently G [ e (x) J is equivalent at £ to 

fl‘k(x)]Fte k - l (z)]--'Fle l (x)]. 

Hence G(x) is equivalent to G [e(x)]F(x) at such a point f. 

If, on the other hand, m is zero then the integer k may be taken zero. In 
this case G(x) is equivalent to F(x) at £, while G(x) is not singular at 
e(£). In this case also G(x) is equivalent to G [e(x)J F(x) at £. 

The italicized statement made is now proved. 

The usual existence theorems for linear difference and ^-difference equations 
show that ife(x) = x + l orgx respectively, and if F(x) is a polynomial 
matrix we may choose G(x) equal to G [e(x)J F(x). 

Harvard University 
March, 1910 
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THE BOUNDARY PROBLEMS AND DEVELOPMENTS 
ASSOCIATED WITH A SYSTEM OF ORDINARY 
LINEAR DIFFERENTIAL EQUATIONS OF 
THE FIRST ORDER 

By George D. Birkhoft and Rudolph E. Langer.i 
Introduction. 

It is the purpose of this paper to develop in outline the theory of a 
system of n ordinary linear differential equations of the first order 
containing a parameter and subject to certain boundary conditions. 
Toward this end the notation of matrices is used. For the convenience 
of the reader the paper opens with a brief review of the fundamentals 
of matrix algebra and the integration and differentiation of matrices. 
This is followed by an expository discussion of the homogeneous and 
non-homogeneous differential matrix equations of the first order. 
The major portion of the treatment is devoted, however, to the homo¬ 
geneous differential vector equation with a complex parameter in its 
coefficient, and to the system composed of such an equation and 
suitable boundary conditions. The solutions of the equation for large 
values of the parameter are discussed and the formal development of 
a vector of arbitrary functions into a series of solutions of the system 
is obtained. The paper closes with the proof of the convergence of 
this development under appropriate conditions, which, in the ordinary 
notation, establishes the possibility of simultaneously expanding n 
arbitrary functions in terras of the characteristic solutions of a prop¬ 
erly restricted differential system of the type 

Vi( x ) = 2 |a<*(x)X + £<*(*) |!/*(*), 

n 

2 fo.*y*(a) -f 0.*y*(6)) =0, i = 1 , 2,.. n. 2 

4-1 

When reduced to a single equation of the nth order this includes as a 
special case the expansions obtained by Birkhoff in 1908 

1 Much of the material preceding the proof of convergence is due to Birk¬ 
hoff, having been developed by him in lectures at Harvard University in the 
fall of 1920. The reorganization of this material into its present form, the 
treatment of the irregular case, and the proof of convergence are due to Langer. 

a For other developments in this field and more complete references see the 
following papers in the Transactions of the American Mathematical Society: 

Birkhoff, On the Asymptotic Character of the Solutions of Certain Linear 
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Section I. 

Definitions . 3 

An array of elements of the form 

( on an on ... • Olf* 


o*i flw . 

oai .Oj n 


a«. a nn 


in which the number of rows equals the number of columns is called a 
square matrix, and is denoted either by (a if ) or by A. Two such 
matrices are said to be equal when, and only when, every element of 
the one is equal to the correspondingly situated element of the other. 

The sum of two matrices, (a,,) and (&„•), having the same number of 
rows and columns, is by definition the matrix (a*/ + 6„), from which 
it follows that matrix addition is both commutative and associative, 
i.e., A + B = B + A, and A + (B + C) = (A + B) + C. 

The product of two n rowed matrices A and B is defined by the 
identical equation 


(an) (M 



on the basis of which it is easily verified that matrix multiplication is 
both associative and distributive, i.e. 


A(BC) = ( AB)C , 
A(B + C) = AB + AC. 


Differential Equations Containing a Parameter, and Boundary Value and Ex¬ 
pansion Problems of Ordinary Linear Differential Equations, vol. 9 (1908), p. 
219 and p. 373. 

Wilder, Expansion Problems of Ordinary Linear Differential Equations with 
Auxiliary Conditions at More than Two Points, vol. 18 (1917), p. 415. 

Hopkins, Some Comergent Developments Associated with Irregular Boundary 
Conditions, vol. 20 (1919), p. 245. 

Hurwitz, An Expansion Theorem for a System of Linear Differential Equa¬ 
tions of the First Order (about to appear in vol. 22 (1921)). 

Langer, Developments Associated with a Boundary Problem not Linear in the 
Parameter (about to appear in vol. 22 (1921)). 

3 For a more ample discussion of the theory of matrices see Bftcher, M., In¬ 
troduction to Higher Algebra. New York; The Macmillan Co., 1907. 
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That it is in general not commutative is a consequence of the fact that 

S aikbkj is in general not equal to 2 b ik a ki . The rearrangement of 

the factors in a matrix product is, therefore, as a rule, not permissible. 

The multiplication of a scalar into a matrix has simply the effect of 
multiplying each element of the matrix by the scalar. Thus if A: is a 
scalar, then kA = Ak = (lean). Conversely, any factor common to 
all the elements of a matrix can be factored from the matrix. 

Two special matrices must be mentioned, namely 0 = (0), the 
zero matrix, and / = ($,/), the unit matrix, where 5 ,7 = 0 when t 
= 1 . These matrices satisfy respectively the relations 

AO = OA = 0 
and AI = IA = A. 

The determinant formed from the elements of a matrix without 
changing the order of the array is called the determinant of the matrix. 
The alternative notations | a {i | or | A | for the determinant of the 
matrix (an) will be used. 

Given a matrix A it follows that if | A | 4= 0 then there exists a 
unique solution in the z’s for each of the linear systems 

n 

2 Oik Xki 0 — * = 2, .. .n, 

*-1 

where jo ranges from 1 to n. This means that there exists a unique 
set of n 2 quantities *</ such that 

n 

2 Qik Xk / = $»7» *» i» = 1, 2,...n, 

*-i 

i.e. there exists a unique matrix (*</) such that 

A-(xn) = I. 

This matrix (z<y) is denoted by the symbol A~ } and is called the inverse 
of A. From its derivation it is seen to satisfy the relation AA~'= /. 

Either of the relations, AX = 0 or XA = 0, leads, on the assump¬ 
tion that | A | 4= 0, to the conclusion X = 0, as is evident from the 
theory of the systems of linear equations to which the matrix equations 
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are equivalent. It follows from the relations A A- 1 = I and IA = AI, 
however, that AA~ l A = IA = AI, i.e., A(A~ l A — /) = 0. 

Hence if A is any matrix for which | A | 4= 0, it is seen that 

A-'A -1=0, i.e., A-'A = I = AA-*. 

In accordance with the following definitions, namely 

JA(x) dx = a {i (x) dx'j 

a a 

dA(x) _ / 

dx \ dx r 


and 


a matrix is seen to be integrable or differentiable if and only if this is 
true of each of its elements. It is also dear that if C is a matrix of 
dC 

constants, then -- = 0, while for any product 
dx 

d A n dB dA n 

dx dx dx 


Section II. 

The equation Y’(x) = A(x) V(x). 4 

Consider any matrix of functions K(x) which satisfies (i.e. is a 
solution of) the equation 

(1) Y'= AY. 

In accordance with the rules for determinants we have 



y'n ....yin 


Vn ... yi- 


yn.yin 


d | Y | 

yn ....yin 


y'n ... y5n 


yn .y*n 


<■ 


-f 


+...+ 



dx 





» 


ym ....y„n 


ym ....ynn 


y n i ... .y«n 



4 For the general theory of matrix differential equations see Schlesinger, L. 
Vorlesungen uber lineare Differeniialgleichur^en. Leipzig; B. G. Teubner] 
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whence, substituting from the system 


y*i = 2 Oikyti, i,j, = 1 , 2 . .n 

*-i 

(which is equivalent to equation (1)) we obtain 


d\Y 

dx 


2 auy*,.. .2a lk ykn\ 


y2n 


k 

yz\ 


ym 


ynn 


+ 


yi i . y i» 

2 a 2k y kl .. .2 a 2 *y*„ 


ym 


ynn 


+ ...+ 


yn 

yn 


yin 

yin 


2 a nk y k i. y*. 


i.e. -|y|-a I 1 |yH-a„|y|+...+a„|F|. 

Let us suppose, now (t), that the elements of A(*) t Y(x), and Y'(x) 
are each continuous in an interval a g x g 6, and (it), that at some 
point x of this interval | Y | 4= 0. Then throughout a neighborhood 
of the point in question 

d | Y I 

TTTT " 2 a ** dx > 

y *-i 


or, integrating, 


y| = ce 


X 2 a “ dz 


Inasmuch as the right-hand side of this equation cannot vanish if 
c 4= 0, it is seen that the hypothesis that | Y | 4= 0 for some x leads to 
the conclusion that | Y | differs from zero for all x. Thus we infer the 
Theorem : If Y ( x ) is a matrix of functions which satisfies equation 
(1), while | Y(x 0 ) | = 0, a g x 0 g 6, then | Y(x) | a0, agxgb. 

A matrix of functions Y(x) of this type which satisfies equation (1) 
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and whose determinant | Y(x) | ^ 0, is called a matrix solution of 
equation (1). We assume at least one such solution to exist. 

Theorem: If Y(x) is any (matrix) solution of equation (1), and C 
is any matrix of constants (for which | C | :£ 0), then y(x), defined by 
Y(x ) = Y (x)C, is also a (matrix) solution of equation (1). 


Proof: We have 


dY 

dx 



But by hypothesis 

Hence 


s— 

‘2-Arc, 

dx 


i.e. 




Moreover, 3ince the product of two matrices is derived in the same 
manner as that of two determinants, it follows that | Y | = | Y \ \ C |. 
Hence | Y | * 0, if | C | 4= 0 and | Y | * 0. Q.E.D. 

Theorem: Given any matrix of constants Y 0 for which | Yq | 4= 0, 
then there exists a matrix solution of equation (1) which for any 
preassigned x, say x = io> reduces to the matrix Po- 

Proof: It has already been shown that when y(x) is a matrix solu¬ 
tion of equation (1), then y(ar)C is also such a solution, where C may 
be any matrix of constants whose determinant is not zero. Then in 
particular C may be chosen as the matrix y _1 (*o) y 0 , whereupon it 
follows that y(x) y-*(x 0 ) Y 0 is also a matrix solution of equation (1). 
This solution, however, obviously reduces to the matrix Y o when 
x = xq. Q. E. D. 

Theorem: If Y(x) is a matrix solution of equation (1) then Y(x)C 
is the most general solution of equation (1). 
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Proof: Let_y(or) beany solution of equation (1) whatever. Then 
the relation y(x) = y(x)4»(x) defines a matrix <f>(x), and substitut¬ 
ing from this expression in equation (1) we have 


i.e. 

But by hypothesis 


Hence 


i Y * 


AY*, 


d* dY 

Y— + —* = AY*, 
dx dx 


Y^+AY* = AY*, 
dx 


i.e. 



0 . 


Inasmuch as | Y(x) | 4= 0, it follows that 


d* 

dx 


= 0 , and *(x) = C. 

Q. E. D. 


A mere interchange of the rdles played by the row^and the columns 
of the matrices involved transforms the discussion carried out thus far 
for equation (1) to the corresponding discussion for the equation 
r= YA. The facts in the two cases may, therefore, be established 
by precisely the same methods. 

The pjiir of related equations 


( 1 ) r « AY, 

(2) Z' = - ZA, 


are said to be adjoint , either being the adjoint of the other. 

Theorem: If K(x) is any matrix solution of equation (1), and Z(xj 
is any matrix solution of equation (2), then 

Z(x) Y(x) m C. 

Proof: From the relation 



dZ 

T Y + Z 

dx 


dY 

dx 
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it follows, upon substituting from equations (1) and (2), that 

tZY= {-ZA) Y + Z{AY). 

dx 


Hence ^ ZY = 0, and Z(x) Y(x) =C. Q. E. D. 
dx 


Converse Theorem: If Y(x) is any matrix solution of equation 
(1), and the matrix Z(x) is defined by the relation Z(x) Y(x) a C, 
where | C | ^ 0, then Z(x) is a matrix solution of equation (2). 

Proof: We have ^-ZY — 0, 

dx 


.e. 


f* + z T = 0 - 


In virtue of equation (1), therefore, 


- 7 - Y + ZA Y - 0, 
dx 


and it follows, since | Y | 4= 0, that 


dZ 

dx 


-ZA . 


Moreover, | Z | = \Y’ 1 \ • | C | * 0. 


Q. E. D. 


Any pair of solutions y(ar) and Z(x) of equations (1) and (2) re¬ 
spectively which satisfy the relation Z(x) Y(x) 3 1, are said to be 
associated solutions. Thus if Y(x) is any matrix solution of equation 
(1) the associated matrix solution of equation (2) is Z(x) = Y~ l (x). 

A differential equation is said to be self-adjoint when and only when 
it is identical with its adjoint after interchange of rows and columns. 
A necessary and sufficient condition that the equation (1) be self- 
adjoint is readily seen to be that a,-,- = — ay< for all i and j. 
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Section III. 

The equation Y'(x) = A(x)Y(x)+ B(x). 

The Existence Theorem: Given the equation 

(3) Y'=AY + B, 

where A(x) and B(x) are matrices of continuous functions, a ^ x ^ b, 
then there exists a unique matrix Y(x ) whose elements are functions 
which are continuous together with their first derivatives, a ^ x ^ b, 
and which satisfies equation (3) as well as the condition y(:ro) = Yo, 
the matrix Y 0 being any prescribed matrix of constants. 

Proof: By means of the relation 

x 

y„Cr)=y, + f U(t) r m -,(t) + b( t) I dt 

ft 

it is possible to define the following infinite sequence of matrices 
Yi(x), Y 2 (x),.. . Y m (x), .. . which satisfy the relations 

Y’,(x) = A(x) Yo + B(x), Y,(x 0 ) = Y„ 

Yi(x) = A(x) Yi(x) + B(x), y,(i„) - Yo, 

YJ(x) = A(x) + B(x), y.(*o) - Yo. 


Then setting y„(x) — y„_i(x) = U m (x) 

in the identical equation 

Y m (x) - y»+ (y.(i)- y.)+(y,(x)- y,(*))+... +(y - (*)-y m _ 1 (*)) 

we have 

y n (x) = Yo+ u,(x) + Ut(x) +...+ U m (x). 

Moreover, U„ (x 0 ) = 0, 

while UJ{x) = YJ(x) - y..,'(,) 

= A(x) fy m .,(x) - y | 

= A(x) J7._,(x). 
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Denoting by the symbol «the fact that each element of the matrix 
of constants on the right represents a value as large as the largest 
numerical maximum attained by any element of the matrix on the left, 
we have, since A(x ) and U i(x) are continuous in the closed interval 
x £ b, 

U x (x) «(*), 

A (x) «(«) . 


From this it follows that 

U' t (x) = A(x) U, (x) = Qa„ t$) «(nak) 
and integrating 

lh(x) « ( kna | * — x 0 | ). 

Likewise U’, = AU t = (lau «( 1 : a• kna | x - *o | j , 

U,(x)«(biW l *"* 01 ’ ), 

and similarly for / - 3, 4, , 

However the infinite series of matrices 


converges to the matrix 


Hence the series 


/ ® 1 anb—a\ l \ 
\W) l / / 


k -1). 


03 


Yo + 2 


j-i 


converges uniformly throughout the interval a ^ x ^ b, and since its 
terms are matrices of continuous functions the matrix Y{x) which it 
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represents is necessarily one with elements continuous for a ^ x 6. 
Now 

X 

K.(z) = r.+ f \ a ( i ) y «_.(<) + B(0) a 

a 

by definition, and since the convergence of Y m (x ) to Y(x) is uniform 

X 

lim y.(x) = Ko+ f Hm { A(t) Y„.,(() + fi«)) dt, 

m- oo u m-co 

s» 

X 

namely, K(x)=y 0 + f lA(t)Y(t) + B(t)] dt. 

X» 

Differentiating we see that the elements of Y\x) are continuous, 
a x ^ 6, while K'(x) = -4(x) T(x) + B(x). Hence the solution 
exists as stated. 

To prove the solution unique suppose that both Y(x) and Y(x) are 
solutions of equation (3) each reducing to the matrix Y 0 for x =» x 0 , i.e. 

Y'M - AM Y(x) + B(x\ Y(x 0 ) = Y 0 , 

Y'(x) - -4(x) T(x) + £(*), Y(x ,) = y 0 . 

Now K(x) — K(x) s Z)(x) is a matrix whose elements are continuous, 
a ^ x £ 6, and which satisfies the relations 

D f (x) = A(x) D(x), DM) - 0. 

Let any neighborhood of the point x 0 be chosen, say | x — x 0 | g 6. 
Then, if d denotes the largest numerical maximum of any element of 
D(x) for any x of this neighborhood so that D(x) « (d), 

A(x) D(x) « (nad), 

and D'(x) «(nad). 

Hence D(x) « (nad | x — x 0 | ) « (nadS). 

But for some x of the interval at least one element of D(x) is numeri¬ 
cally equal to d (by the definition of d), and for this element it follows 
that d ^ nadS, i.e. d { 1 — 7ia$) ^ 0. 
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However, since a and n are fixed numbers and 6 can be chosen so that 

$< — this leads to a contradiction unless d = 0. It follows that 
not 

there always exists a finite interval throughout which d = 0 and hence 
throughout which D(x) = 0. Inasmuch as this implies that D(x) = 0, 
a ^ x ^ 6, the solutions Y(x) and y(x) must be identically the same. 
Q. E. D. 

The solution of equation (3) can be easily expressed in terms of the 
solutions Y k (x) and Z k (x) of the homogeneous equations (1) and (2) 
respectively. Thus, multiplying both sides of equation (3) on the 
left.by Z k (x) we have 

Z k ^ = Z k A Y + Z„B, 
dx 

which, in view of equation (2), can be written 

Z 'T = " IT Y + Z ” B ' 

dx dx 

i.e. 

j- Z„Y = Z h B. 
dx 

Integrating we obtain 

* 

Z k Y = C + J Z k (t) B(() dt, 

a 

while the multiplication of this equation on the left by Y^ (x), the 
solution associated with Z^(x), yields 

* 

(4) y(x) = Y k (x) C + f Y k (z) Z„(t) B(t) dt. 

a 

This is the general solution of equation (3). In consequence any 
particular solution Y(x) may be written 

z 

Y(x) = Y h {x) C+J Y k (x) Z h (t) B(t) dt. 
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Subtracting this from equation (4), however, we obtain 

Y(x) = Y(x) + Y k (x) I C-C), 

from which it is seen that if Y(x) is any particular solution of equation 
(3) the general solution is given by 

(5) Y(x) = Y(x) + Y h (x) C. 

If in particular C is taken as C = 0 in formula (4), the solution which 
is characterized by the fact that Y(a) = 0 is obtained. This is called 
the principal solution at x = a. Since the choice of a as a limit of 
integration is unrestricted, the expression for the principal solution at 
any chosen point is at hand. 


Section IV. 

The differential system 

V <*>• = Mx)Y(x). + B(x ). 

W 0 Y(a). + W h Y(b). = 0. 

A matrix in which any row (or column) is precisely like every other 
row (or column) is called a vector. That a particular matrix is a vector 
is indicated by means of a dot suitably placed in relation to the letter 
designating the matrix in question, the dot preceding in case it is a 
vector in which the rows are the same and succeeding in case it is a 
vector of identical columns. 


Thus 


and 


•A = (a,-) 


A • = (a.) 


ai a* •.a« 
at a 2 ...a. 


a i a 2 ...a n 
a\ aj...ai 

a 2 a 2 ... a 2 


a n a„ . .a n 
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That -AB = • C , and that AB • = C-, are facts readily established by 
direct reference to the rules for multiplication. A matrix (a) all of 
whose elements are the same is written -A-. From the preceding 
statement it is clear that AB- = -C-. 

Given W a and W b , any two constant matrices for which | W a | 3= 0 
and | W b | 4: 0, then the two corresponding homogeneous vector 
differential systems 

_ ( n*)- = AM Y(x)- 

(6) + W b Y(b)- - 0 


and 

(7) 


( -Z'(z) = - -Z(x) A (x) 
l ’Z(a) HV+ -2(6) W b ~'= 0 


are said to be adjoint. 

Theorem: The number of linearly independent solutions of system 
(6) is always equal to the number of linearly independent solutions of 
system (7). 

Proof: It has been shown that if Y(x) is any matrix solution of the 
differential equation (1), the most general solution is }'(*) =Y(x)C. 
From this, C = Y' l M I'M. and if the solution Y(x) is a vector 
y(a-)- then C will be a vector C-. 

The general solution of the differential equation in system (6) is, 
therefore, 

Y(x)- = Y(x) C-, 

and since the substitution of this in the boundary conditions gives 

(8) W a Y(a) C- + W b 7(6) C- = 0, 

Y(x)- is seen to be a solution of system (6) when and only when C- 
is a solution of the equation (8). Moreover, it is readily seen that a 
necessary and sufficient condition that a set of solutions of system (6) 
be linearly independent is that the corresponding solutions of equation 

(8) be independent. Setting W a K(a) + N & Y(b) — (po), equation 
<8) can be written in the form 

n 

(9) 2 pik Ck = 0, i = 1, 2,.. .n. 

*-i 
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The number of linearly independent solutions of the linear algebraic 
system (9), (and hence of equation (8)) is, however, precisely equal to 
the difference between the number of equations, n, and the rank of the 
determinant | p</1. Thus when | \V a Y(a) + W b Y(b) | = 0, and is 
of rank (n — k), then there are precisely k linearly independent solu¬ 
tions Ci*, C 2 *,...C**, of equation (8), and correspondingly just k 
linearly independent solutions Yi(z)- = Y(x) C,* t= 1,2 t .. .k, 
of system (6). Conversely if there are k independent solutions of 
system (6) the rank of the determinant is (n — k). 

Suppose then that system (6) is known to have just k linearly inde¬ 
pendent solutions. This means that the determinant vanishes and is 
of rank ( n—k ), and since | X\U X* | is of the same rank as | U | when 
| X\ | 4= 0, and | Xt | 4= 0, 6 it follows that 


| Z(a) W a -' { W a Y(a) + W b Y{b) | Z{b) JV b -' | = 0 


and is of rank (n — k), Z(x) being any matrix solution of equation 
( 2 ). If in particular Z{x) is chosen as the solution associated with 
Y{x) this reduces to the statement that | Z(a) W a ~'+ Z(b) W b ~ l | = 0, 
and is of rank (n — k), which implies that system (7) also has just 
k linearly independent solutions. Thus the theorem is proved. 

A system of the type ( 6 ) or (7) is said to be either compatible or in¬ 
compatible according as it does or does not admit of a solution not 
identically 0. Its compatibility is said to be Ar-fold when the number 
of its linearly independent solutions is k. 

Consider the non-homogeneous system 


I Y'(x)- = A(x) Y(x)- + B(x )• 
I W a T(a). -f- W b T( 6 ). = 0. 


Theorem: A necessary and sufficient condition that system (10) 
has a unique solution is that the corresponding homogeneous system 
( 6 ) is incompatible. 

Proof: It was shown (formula (5)) that the general solution of equa¬ 
tion (3) is 

Y{x) ^ Y h (x) C -f Y(x) 


5 Cf. B 6 cher, loc. cit., pp. 77-79. 


361 



66 


BIRKHOFF AND LANCER. 


where Y(x) is any particular solution and Y b (x) is a matrix solution 
of the homogeneous equation (1). Hence the general solution of the 
differential equation in (10) is 

Y(z)- m Y h {x)C- +?!)• 


The substitution of Y{x)- in the boundary conditions shows it to be 
also a solution of the system (10) provided only that C- satisfies 

W a y(o) + W„Y(b) + | W a Y h (a) +W b Y h (b)\ C- = 0. 

But this relation can be solved for C-, and uniquely determines C- 
when and only when | W a K*(a) -f- M'* Y/,(b) | 4= 0, that is when 
system (6) is incompatible. Q. E. D. 

Assuming then that system (6) is incompatible it is possible to 
obtain by the following procedure a solution of the equations (10) 
which is symmetrical with respect to the ends of the given interval 
a £ x ^ b. 

Let y»(x)- and K*(ar) • be any pair of solutions of the differential 
equation 

(11) K'W- - A(x) Y{x)- +**(*)•. 

Then clearly their sum Y\(x)* + Yt(x)' = y(x)* is a solution of equa¬ 
tion (10). But by formula (4), applied to equation (11), a particular 
choice of Y i(z)* and Yt(x) • is seen to be 

X 

y.to- = } / Y„(x) Z„(t) B(t)-di 

a 

X 

y»(*)- = i / y k (z)z„(t)B(t)<u, 

6 


from which it follows that 


Y(z)- = i 


j/ Y k (x)Z„{I) B(t)-dt + f Y„(x)Z h (t) 


is a particular solution of the differential equation (10). 
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Defining G(x, t) by the relations 


G(x, t) 


i Y k (x) Z h (t) 
-§ Y h (x) Z h (t) 


when t < x 
when t > x, 


we may write 


Y(x) 


t> 

-/«<*■« 


B(t )• dt. 


In accordance with formula (5), therefore, the general solution of 
the differential equation (10) is given by the equation 

6 

Y(*)- = f G(z, t)B(t)-dt+ Y. h (x)C■ . 


The substitution of this form in the boundary conditions yields the 
equation 


b 

f [WoG(a,t) + W b G(b,t)\B(t)-dt+lfV.Y h (a) + W„Y k (b)}C- = 0 

a 


for the constant vector C-. Multiplying by the inverse of the matrix 

^ = 1^. y*(a) + W b f a (6)|, 

(A“* exists since system (6) is incompatible) we see that 

6 

C- - - A -'f | W.G (a, t) + W b Q (6, t )) Bit) ■ dt. 

a 

It follows that the general solution of system (10) is, in terms of a 
matrix (?(x, t ) which is defined by the formula 


(12) G(x, <) = G(x, t) - Y h {x) A- 1 [W a G(a, t ) + W b G(b, <)), 
given by the expression 


(13) 



G(x, t ) B{t)- dt. 
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The matrix of functions G(x, t ) is known as the Green’s function for 
the homogeneous system (6). By a precisely similar method the 
Green’s function //(*, t) for the related system (7) may be derived, the 
general solution of the non-homogeneous system 

I • Z'(x) = - -Z(x) A(x) + • D{x) 


1 . Z\x ) = - - Z(x) A(x) + -Di 
\ • Z(a ) W a -' 4- -Z(b) W b -'=0 


being given in terms of H(x , t) by the formula 


O 

■Z(x) = f ■ D(x) H(x, t) dt. 


Theorem: If G (x, t) and //(z, t) are the Green’s functions for sys¬ 
tems (6) and (7) respectively, then 

G(x,t)+ H(t, x)mO, t*x. 

Proof : 6 Let B • and D be any two vectors of the types indicated 
and consider the two systems 

r- = AY- +B-, W a K(a)• 4" W b Y(b)-= 0, 

. Z f - - ZA +-D, • Z(a ) W a -'+ • Z(b) W b -'= 0. 

Multiplying the differential equations respectively by -Z on the left 
and Y • on the right and adding we obtain 

-ZY-' 4- -Z’Y- = ZB- 4- -DY-, 

an equality which upon integration yields 

b b 

(16) • ZY-\ b a - f -Z(t) B(t)-dt + f -D(x) Y(x)-dx. 

a a 

Now from the boundary conditions we have 

•Z(6) - - -Z(a) W a W b , Y(b)- = - BV 1 W a Y(a)- t 


whence 


z(b) y(6) 


= -Z(a) Y(a); 
■ZY■ I 6 = 0. 


6 The proof by direct computation is not difficult though somewhat laborious. 
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b 

But -Z(t) = J* D(x) H(t, x) dx 

o 

b 

and y(x)*= y* G'(x,t)B(t)-dt 

a 

from formulas (13) and (15). 

Hence equation (16) is equivalent to 
6 6 

(17) if ■D(x) | H ( t, x) + G(x, <)) B(t)-dx dt = 0. 

a a 

This result, having been obtained without reference to the nature of 
B- and D must, moreover, hold for all possible choices of these 
vectors. We shall proceed to choose a particular set apposite to the 
proof in hand. 

Let any point (x 0 , t 0 ) of the region a g x, t g 6, x 4= t, be arbitrarily 
chosen and draw the surrounding small rectangle As whose sides are 
t — < 0 =*= A t and x = x 0 =*= A x. Then choose £• and •/> so that 
b t (0 = 0, when l 4= j 0t b io (t) 4= 0 in As, b io (t ) - 0 outside of As, 
d k (x) m 0, when k 4= t'o, d i9 (x ) 4= 0 in As, d^x) = 0 outside of As. 

For this choice equation (17) is equivalent to 

to* At xo*Az 

(18) / / *•(*) !*<«,(<. z) + g ioi ,( I, <)) b,,(t) dxdt = 0 , 

to-At x®-Ax 

and inasmuch as <*<,(*) b io (t) 4= 0 in A*, clearly {h^t, x) + *«*(*, t)\ 
changes sign, i.e. vanishes somewhere in this region. Now let Ax and 
At approach zero. Then in the limit we have 

Aitfofo, *o) + litfoGto, to) = 0. 

From this it is seen that G(x 0 , t 0 ) + H(t 0 , x 0 ) = 0, and since (ar 0 , to) 
was any point not on the diagonal it follows that 

G{x, t) + H ( t, x) = 0, i4l. Q. E. D. 

By direct reference to formula (12) it is readily seen that the Green’s 
function possesses the following characteristics: 
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i) The elements of G'(x. t) are continuous in x except for x = t. 
Along this line (x = t) there is a jump of unit magnitude in the 
elements of the principal diagonal, i.e. 

G’(x, x - 0) - G'(x, x + 0) = /. 

ii) For any given t, C(x, t) satisfies equation (6) in x, except along 
the line x = t, i.e. 

OX 

iii) For any given t, G(x, t) satisfies the boundary conditions of 
system (6) in x, i.e. 

W a G(a, t) + W b G’(6, t) - 0. 

Conversely we have the 

Theorem: The dependence of G(x, t) upon the variables x, t is com¬ 
pletely determined by the characteristics (i) to (iii) above. 

Proof: Suppose G (x, t) is a matrix possessing the characteristics (i), 
(ii), and (iii). By (i), (ii), «/(x, t) defined by «/(x, I) =G' (x, t ) — G(x, () 
is continuous for all x, a ^ x ^ b. Moreover, J(x, t) satisfies (iii) and 
is therefore a solution of system (6) in x. But system (6) is incompati¬ 
ble by hypothesis. Hence J(x, /) ■ 0 

and G(x, t) - <?(x, <)• Q- E. D. 

It is readily verified that a further set of three characteristics which 
completely determines the dependence G(x, <) upon the variables may 
be obtained by interchanging x and t and replacing system (6) by 
system (7) in the discussion above. 

The fact established by this theorem should be carefully noted. 
While the choice of the pair of associated solutions Y h and Z/, on the 
right-hand side of equation (12) is not unique, yet the entire function 
G'(x, t) is independent of that choice 
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Section V. 

The formal solutions of the equation 
V"<*)- - { A(x)\ -f* 

Returning to the homogeneous equation let us consider the nature of 
the solutions when the matrix coefficient of Y • is made to depend 
linearly upon a parameter X which is free to take on all values in the 
finite complex X plane. To this end we shall study the equation 

(19) *"(*)• - { A(x) X + B(x) ) Y(x)-, 

where A(x ) and B(x) are matrices of continuous functions, by making 
the assumption that the equation has a formal solution 

(20) Y(x)■ m e k l y{t)dl j />,(*)• + I />,(*). + I p 2 ( x ). + ... j 
where a is any chosen constant. 

If n = 1, equation (19) can be directly integrated and is seen to have 
an actual solution of the form (20). The passage to formulas (45) can, 
therefore, be made directly in this case, and hence we shall assume 
in the intervening work that n ^ 2. 

X 

Setting J’ y(z) dx = r(z), and substituting the form (20) in the 

a 

equation (19), we obtain the formal identity 

Mx)e* r< ” j Po(z) ■ + J *>,(*) • + •.. j + e xr< * ) j PUx ). + ... j 

- U(*)X + B{x) ) e xr<1> j P,(*) • + i Pl{x ). +... J, 

from which it is seen, upon equating the coefficients of X, that 

y(x) Po(x) • s A(x) P 0 (x)-. 

This is satisfied by a vector P„(z)- not aU of whose elements vanish, 
when and only when 

( 21 ) I <**>(*) “ Si, y(x) | = 0, 
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i.e. when y(x) satisfies equation (21). For any given x, say x = x 0 , 
however, the left-hand side of (21) is a polynomial of degree n in 7(*o). 
Hence the equation is satisfied by n roots 7i(xo), 72 (^ 0 ), • 7n(xo). 
We shall assume that for the case in hand these roots can be grouped 
into the n functions 7i(*)» 7*(*).-•-7»(*)» wh >ch satisfy the three 
conditions 

( (i) 7 <(x) continuous, t = 1, 2,.. .n, a ^ x ^ 6, 

(22) < (ii) 7 i(x) 4= 7»(x) for j 4= h 

( (iii) 7»(x) 4= 0, t = 1, 2,.. .n. 

Clearly the last condition can be fulfilled only if | A | 4= 0; we shall 
assume this to be the case. 

Consider now a change of the dependent variable in equation (19). 
Setting F(:r) * = 4>(x) Y(x) •, where <l>(x) is a matrix whose elements are 
continuous as well as their first derivatives, a ^ x ^ b, and | <1> | 4 1 0, 
the equation becomes 4>' F* + — MX 4- if I ♦ F*, that is 

y'. = | 4 >-i /14>X -f 4>-' B<t> — 4> _l 4>'J F- It is evident here that if the 
coefficient of X is not originally I then no such change of variable has 
the effect of reducing it to 7. In the subsequent work it is desirable 
to transform A{x) into the matrix R(x) given by R(x) = ($</7>(*))• 
By such a transformation the formal solutions (20) are carried into 
others with the same functions 7<(ar) in the exponents. 

Since the 4>(ar) in question must satisfy <t> -1 A$ s R , 7 it must fulfill 
the conditions 

(i) A 4> ■ <t>R, 

(ii) l*(*)|4=o. 

To satisfy condition (i) the elements of $ must be solutions of the 
algebraic system 

n 

2 Oik *Pki = <Pii 7 h h h = 1 , 2 ,... n, 

4-1 

while the possibility of fulfilling.this condition by means of a matrix 
4>(x), none of whose columns contain only vanishing elements, depends 
upon the existence of a solution for each of the n linear systems 

7 For fuller discussion see, for instance, Bocher, loc. cit. Chap. XXI. 
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n 

(23) 2 {a ik — bit y, % \ «p*, # = 0, 

obtained by giving to jo successively the values 1,2, ...n. This 
means that each of the determinants | a ik — 8 ik y, o |, j 0 = 1, 2,.. .n 
must vanish, and inasmuch as the functions 7,Or) were chosen as the 
roots of equation (22), which is precisely the condition that the de¬ 
terminants in question do vanish, the existence of a matrix <f>(z) of the 
type desired is established. 

Condition (ii) requires further, however, that this matrix <£ be one 
for which | 4»(ar) | 4= 0. 

Suppose 1 4>(ar) | = 0. Then the system of linear equations 

n 

2 <p{ k v k = 0 admits of solution by means of a set of quantities V\, 

k -1 

®2> • • • v nt at least one of which differs from zero, 8 i.e. there exists a vector 
V • ^ 0 , such that 4 > K* — 0. But then A&V • = 0, and in view of the 
relation (i) it follows further that *RV• =0. By continued repeti¬ 
tion of this reasoning it can be shown likewise that <PR k V -s 0 for all 
values of k. Consequently 4>{c 0 / + CiR+c 2 IP +. .. -f c„_ 1 R n ~ l \V-zsO. 
But on the hypothesis that 7i 4= 7 i when j 4= i it is well known that 
the determinant 

1 7i 7i* • • .71*” 1 

1 72 72* ...72 n - 1 

. 4=0. 

1 7n 7n 2 . . .7n n ‘ l 
Hence the system of equations 

C0+C171 + C27, 2 +. .. +c n -1 7 1 n_1 = hi 


Co+ Cl 7n -f C 27 „ 2 + . . . 4- C n _l 7i n_I = k n , 

admits of a solution in the c’s not all zero for any choice of the quanti¬ 
fies ki not all zero. Some choice of the set c 0 , Cj. . .c n _ u can therefore 
always be made to satisfy the relation 

_ !co/ +ci/?+... + c„_, R n ‘ l \ a 

8 Cf. B6cher, loc. cit., pp. 47. 
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Then *(«., *,) V‘ = 0, 

n 

i.e. 2 •PiikiTt = 0, for all choices of the set k x , k it .. k n . But by 
i-i 

construction some v, say f/ # is not zero. If then the k 's are chosen so 
that ki = 0, i jo, k, 0 4= 0, it follows that v></ # = 0 for all t. Inasmuch 
as no column of 4> contains only vanishing elements this involves a 
contradiction. Hence the hypothesis 14> (x) | = 0 is not tenable, and 
the 4> in question fulfills condition (ii), i.e. 14>(x) | 0. 

By a change of variable, then, equation (19) can be given the form 
P'.= \R\ + B\ K•, where the matrix B, being given by the relation 
B — <J>-‘ ZM> — 4»- , <l> / , is a matrix of continuous functions. Supposing 
this to have been done we may drop the dashes and consider the equa¬ 
tion in the form 

(24) r(x). = |/?(x)X + *(x)| y(x)-. 

Since we have found that the Y{x)- of (20) may be any one of the n 
vectors y»(x)- obtained by replacing T(x) by I\(x) the further dis¬ 
cussion might well be carried through for each of these vectors indi¬ 
vidually. However, if the matrix E(x) is defined by the relation 

E(x) = (bn e xr > (,) ), 

it is readily seen that the matrices P,(x) can be chosen so that the j lh 
column of the matrix 

(25) Y(x) m { i>,( x) + i P,(z) +... j E(x) 

is precisely the general column of the vector y,(x)* Hence all cases 
are simultaneously treated by the consideration of those formal solu¬ 
tions of the matrix equation 

(26) r(*)- U(x)X + £(x) 1 y(x) 
which have the form (25). 

Inasmuch as E'(x) = (X7,(x) e‘ T < w ) = X ft(x) E(x), the formal 
substitution of (25) in (26) gives the identity 
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(27) jPo+^P.+ ^PH--- j W+ { P! > + { P ' l+ ■] E = 

jxfi + sj jPo+ip,+ ... J E. 
Equating the coefficients of X we obtain the relation 

Pofl = RP 0 , 

i.e. Vi?yi = 7<J#\ 

whence it is seen that p!** = 0, when j 4= i. Equating the coefficients 
of X° we obtain in similar manner the equation 

PxR + P 0 # - RPx+ BPo, 


i.e. 

(28) Pi? 7, + Pi?' = 7 i p\? + 2^ bik Pkj • 

whence it is seen, upon setting t = j that 

PT- Zbutf-bHtff, 




namely, that pf? = /, e € 

Having obtained in this manner all the elements of the matrix Po 
we find further from the equation (28) for t 4= j that 

7/ 7< 


(29) 


^<i> - 
Pa - 


Moreover, equating the coefficients of - we see that 

PzR 4* P'i = *P*4- 


i.e. 


?S ) l7i— 7») + = 2 bn* Pkj , 

4—1 


whence it follows upon setting i = j that 
(30) 


pjr = 2 bn pkj. 

4—1 
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Since the other quantities under the sign of summation in this formula 
have all been determined, py}' may be found by means of a quadra¬ 
ture. When this has been done the matrix P\ has been completely 
determined. 

It should be observed that we have thus far required only that the 
matrices R(x) and B(x) be continuous. To determine the elements of 
P 2 however, we have first, for t 4= j, the equation 


P g) = ~ pi! 1 ' + ' 

7r~ 7i 

and since this formula implies the existence of pJJ*', whereas it is seen 
from (29) that p$ has a derivative only if this is true of the matrices 
R(x) and B(x), we cannot proceed to the determination of P* if R(x) 
and B(x) are merely continuous. 

Let us suppose then that R(x) and B(x) both possess continuous 
derivatives up to and including those of order k 1, but that perhaps 
one of these matrices possesses no such derivative of order (k + 1). 
If, in particular R(x) and B(x) possess infinitely many derivatives we 
may take k = 00 . The derivatives of Pi up to and including that of 
order k are now seen to exist from formulas (29) and (30). 


Equating formally the coefficients of in the identity (27) we 

A 


have 


+ iV . - + bp.-i 


p\f \y>- 7.) = - / 4 r"' + 2 


k -1 


whence 


- P \r y + 2 b ikP <tr'> 

ptf --- for « 4= j, 


7i - 7i 


and again, equating the coefficients of — we have 

A 

P,.X R + K= RP h +i + BP „ 
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i.e. It/- Y.l + = 4 2 6.-* p#, 

whence it is seen, upon setting i = j that 

vT = l b ikP ' k f, 


k -1 


namely that pjf can be determined by means of a quadrature if the 
quantities on the right are known. 

The determination of the elements of p)f by means of these formulas 
depends, therefore, only’ upon a knowledge of the elements of P M -i 
and upon the existence of P'-i. Moreover, it is seen that in general 
P possesses one less derivative than P M _i. Inasmuch as Pi has 
already been determined, and was seer> to possess k derivatives it is 
clear that the matrices P M for /i = 0, + may be suc¬ 

cessively determined, and that in general the matrix P *+1 is merely 
continuous. 

If k is finite, we can, therefore, determine a differentiable matrix 
S(x), given by 

(31) S(x) = | Po(x) + £ Pi(i) +■•• + “* Pk(x) | E(x) 

which will satisfy the equation 

S'(x) = IXR + B\ S(x) + ^ I P k (x) - B{x) P k (x)) E(x), 

i.e. an equation of the form 

(32) S'(x) = j Xfi(x) + B(x) + ±- k L{x, X) J S(x), 

where L(x, X) is a matrix each element of which is rational in X, 
with coefficients continuous in x, given by power series in 

If on the other hand k = <», as many terms of the infinite series 

(0) , Pi? , Pi? , 

Pii ~ Pii + T + ^ + — 
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as are desired may be used. These series are not in general con¬ 
vergent. Nevertheless the formal matrix 


S(x) 


j PM +1 PM + | FM 

jp‘° , + T + ' 0 


which is found in this case, formally satisfies the equation (26). 

With the convention that the term in is to be omitted if k = co 

A 

formula (32) holds in all cases. S(x) will be called a formal matrix 
solution of equation (26) regardless of whether k is, for the case in hand, 
finite or infinite. 

It should be observed that each is not wholly determined but 
contains a single arbitrary constant of integration, independent of x. 
This arbitrariness corresponds to the fact that any convergent power 


senes 


ies cj in (|) 


with constant coefficients may be multiplied into each 


column of S(x) without thereby destroying its property of being a 
formal solution in the sense (32). 

Using the notation (^ 0 ]*» or (^o), for an expression of the form 

. , h , ,*»,*(*.*) 

^°+ 4 --* + ^ 

where \J/ is bounded for | X | large, we have 


Six) = 


It is clear that an alternative form is 
(33) S(i) = (c, 

X 

where Bj(x) = fbiidx. 

a 

By similar considerations the equation Z' = — Z\A\ + B\ may 
be transformed and a formal matrix solution T(x) for the resulting 
equation 
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(34) 


Z\x) = - Z(x) I R(x)\ + B(x) | 


be obtained. This T(x) has the form 

(35) T(x) = (c< e -xr.U)-®,u) [j,.] , 


and satisfies an equation of the form 

T'(x) = - T(x) | XB(x) + B(x) + 

Moreover each row is a formal solution of the vector equation 


A Mix, X) 

A 


(36) •£'(*) “ “ -Z(x) W*) X -I- £(*))• 

Considering differentiation as merely a formal process defined by 
the usual rules it is readily seen that the differentiation of the formal 
matrices S(x ) and T(x) is permissible. Hence we have 


— TS= TS' + T'S 

dx 

= r jxB + B + ^ij s-r jxR + B + i t Mjs 
- Kt\L- M)S. 

A 


Since 


TS = y 2^ c% e 15, J * eg e 15*,J * J 

= (c, c, e k I ****** WWW j «« + ■V +■•••+ w j )' 
we find upon differentiating, and removing the exponential factor, 


Ci Cg | {X(t,—7<) + bjj—bu) 1 5</+ j | 
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K 2 [&ik\k Uht — ™A*I 
X* A.*-» 

Equating the coefficients of X° we have 

Ci Cj { (7,— 7.) * 5 ? + (&yy M toy I 55 °* 


from which it follows that = 0 when i 4= j • Again, equating the 
coefficients of - we have 

A 

(7y— yi) °?j + (toy to<) *$+ 

from which it is seen, upon setting i = j that ffJJ*' = 0, namely that 
sa constant. The relation shows on the other hand that aj* = 0 
when t 4= j- Equating to zero successively the coefficient of each 

individual power of - it is found in the same way that a\f = 0 when 

X 

i 4= j , a]f = constant for n = 1,2,.. .,(k — 1). It follows that 

TS = («M |^j I+ * + ... + £} + j ) 

= (c/ Cj Sa E~ l (*«) £» 


where the coefficients of [ll*_i are constants. 

Now for any choice of the set of series c,- it is clearly possible to 
choose a set c> such that c/C/[l]*_i = 1, j = 1, 2,.. .n. The formal 
solutions S(x) and T(x) corresponding respectively to these values of 
Cj and ?/ are closely related. They are called associated formal solu¬ 
tions and satisfy the relation T(z) S(x) = I + &~ l (*</) 

Since in particular c,- may be chosen as c,- = 1 it is seen that there 
exists a formal solution having the form 

(37) S(a?) = 


In accordance with the definition above the associated solution is 
given by the set of cf s which satisfy the relations c, = 1. It is 
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apparent, therefore, that the solution associated with (37) is of the 
form 


(38) 


T(x) = (*-*r *(*)-*><«> [5,-/1)- 


Section VI. 

The relation of the formal solutions to the actual solutions. 

It was observed in the preceding section that the formal matrix 
S(x) either satisfies equation (2G) only in an approximate sense (i.e. 
satisfies (32)) or, if k = <*> satisfies it only formally, since the elements 
of S(x) are in that case infinite series which are not -in general con¬ 
vergent. S(x) is, therefore, not a matrix solution of equation (26), and 
its significance requires further investigation. 

Consider the actual matrix which is derived from S(x) by retaining 
in the latter only the first (m + 1) terms of its elements, where m is any 
positive integer not exceeding k. This matrix S(x) may be written, 
in accordance with formula (37), 

S(x) = (r^>(x).a,(x)[ 5 ..] m) 

« P(x, X) £(*), 
where P(*,X) = (**><'> [*./)«). 

and is seen to be analytic in X. Since | e fl > (z) 5,y | 4= 0, it follows that 
| S | 4: 0 for | X | > N. If the formal solution S(ar) is written as 
P(x, X) E(x), it is apparent that we have the formulas 


S(x) = 

\ I - (<?] r-' 

\ Six), 

Six) = 

S' +i 

Six), 


from which we obtain upon differentiating and substituting from the 
equation (32), the relation 
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1 


/ + ^Tn (Q) P-' \ SW. 


I 


It is apparent from this that S(x) is a solution of a homogeneous 
differential matrix equation of the type 


(39) 


S'{x) = | \R + B + ^ *(*. X) J S(x). 


The elements of 4>(x, X) are power series in with coefficients 

which are continuous in x, and are seen to be convergent, since these 
elements are rational in X. 

But equation (26) can be written in the form 

(40) Y'(x) = |b(i)X + B(x)+^4>(x,X)| Y(x) Y(x), 

and considering this as a non-homogeneous equation we know, in 
virtue of the developments of page 62, that its solutions, i.e. the solu¬ 
tions of (26), are given by 


y(*) = S(x)C+ Six) T(t) 


js(x) f (0 j - ± +(». X) Y(t) j dt, 


i.e. by 

(41) Y(x) -S(x)C- ± J S(x) TO) *0, X) 7(0 

where T(x) ■ S~ l (x) and where the lower limit of integration, which 
has been omitted, may be chosen at pleasure for each of as many parts 
of the integrand as desired. 

Substituting for Y{x) in equation (41) its equivalent as given by 
the form 

( 42 ) Y(x) = U{x) S(*), 


we have further 
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(43) 


U(x) = Six) CT(x) - JL JS{x) T(t)<Ht, X) U(t)S(t) T(x) dt. 


Theorem : If the functions 7»(*)> * = 1, 2,... n, satisfy the relations 
arg I Vj(x) y<(ar) | = h iit i, j, = 1, 2,.. .n, where each is a con¬ 
stant, then there corresponds to each sector bounded by two adjacent 
rays ” 7 * 0 * 0 )) = 0 8 a choice of the lower limits of integra¬ 

tion which is such that for X within the sector and | X | > N, and for 
any continuous matrix U(t), each element of the matrix 

z 

*(*, X) a j S(x) f(t) 4>«, X) Uit) Sit) fix) dt 

is less numerically than KM, where M is the largest numerical maxi¬ 
mum attained by any element of V. 

Proof: Writing 

Six) =» (*</(*)) E(x), T(x) - £-'(*) (U ,(*)), 


we have 

f(x, X) = ( Z f (x)e X,PA(,) r * (<), <Ai(0 •Pipit, X) u PQ (t) 
\ h. I. p. Q. r—I J 


a,,it) e M «■-<« - r,(») | t rl ix) dtj 

-i(Jf'*’-'** .v 

where, for | X | > N, \ w ‘J/ 1 | < for all t and j, k (h,) being a 

positive constant. 

Consider a particular sector and any element 


z 

/ 


x/ { y h (0-y r (t)\dt 


oj 


ihr) 

V 


dt. 


if fllx{ 7 »(f) — 7r©l ^ 0 for X within the sector and any £ it is so 
for all £, and, provided that t ^ x, the integral 


• The notation R {*>) is used to indicate “ the real part of f.” 
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is less numerically than k {hr) M. However, if /flX| 7 *(£) — 7r($)| I > 0 
for X within the sector and any £, it is so for all $, and, provided that 
t 2^. x, the integral 


X 


x/|y 4 (f) -y r ( t ) |dt 


to 


%'dt 


is similarly bounded. Consequently if K hr (\) is defined by the rela¬ 
tions 

* Ar = a if rtlxbiitt) - 7f(l)l I S o, *», = b otherwise, 
the numerical value of each element of the matrix 




( i fe'!'"-'' 

V *' 


-y,U)\di 


to 



n 

is clearly less than KM, where K = (6 — a) 2 & <Ar) , for X within the 

sector and | X | > A’. Q* D. 

Assuming the limits chosen in the manner above, a i(x, X) corre¬ 
sponding to each sector and to each U is uniquely determined. More¬ 
over, we have 

(44) U(x) - S(x) CT(x) - “ *(x, X). 

A 


Consider now the particular solution. )V.r), of equation (20) which 
satisfies the relation )*<>(«) = S(a). Since S(x) as well as the coefficients 
of equation (20) arc analytic in X, I'oU) is likewise analytic in X. 
Moreover we know from page 50 that every solution of the equation is 
of the form }'(•**) = ) •(■*■) D t where the elements of I) are constants 
with respect to .r. Substituting this form of Y(x) into equation (41) 
and fixing x and solving for C we obtain the relation 


c 



I'oCro) + £ 


i 

-j s 


SM nt) 4>(f, X) r o (0 (It D, 
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i.e. 


Cij = 2 p ih d hj , 


h -1 


where the quantities p,v are analytic in X. 

Inasmuch as y(z) is a non-identically zero solution if D =£ 0 there 
will exist such a solution for which C = 0 provided the determinant 
| Pih | vanishes. If on the other hand this determinant does not 
vanish then there exists a solution Y(x) ^ 0 corresponding to every 
C 4= 0. 

Suppose the determinant | p ih \ = 0 . Then there corresponds to the 
choice C = 0 a solution T(x) for which the matrix U(x) defined by 
Y( x ) = U(x) S(x) satisfies the relation 


U(x) 


-Mif" 

A 'A.r-1 „ 

K k.r 




0,$') dt) 


But we know that for some x t say x 0 , and for some t, j, say to, jo, 
Ui^(xo) = 3/. Then since 

I £ <ku. 

X.r-1 r. 


.f 


we have for the to, jo element, M < 


KM 


i.e. 




Inasmuch as M is positive and X can be taken arbitrarily large ti.is 
involves a contradiction, and proves the hypothesis untenable. 
Hence | p,a | 4 1 0. Accordingly there corresponds to every choice of 
C ^ 0, and hence to the particular choice C = I, a unique 1) 0 , and 
hence a solution Y(x). By equation (44) the U(x) for this >'(*) must 
satisfy 

V(x) = / - -4 #(*. X) . 


We infer from this the inequality, 
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whence it follows that for X sufficiently large, M is less that 2 and 
yp{x, X) is uniformly bounded, for X in the sector in question and further 
| X | >N. Accordingly we have 

r« = («„ +^r^) (*«(*)) E(x). 

i.e. V(x) - (*„(*) E(x). 


Since the matrices K 0 (*) and D are analytic in X the same is true also 
of Y(x). It is seen, moreover, that | Y | 4= 0. Hence, when the 
functions y »(*) satisfy the conditions of the theorem on page 83 there 
exists in every sector of the type described above a matrix solution 
Y(x) whose elements are continuous in x and analytic in X, and are 

the same as those of S(x) to terms in . But by construction the 

A 

elements of S(ar) are the same as those of S(x) to terms in . Hence 


we have the 

Theorem: Given any formal solution S(x) of equation (26) in which 
R(x) and B(x) both possess derivatives up to and including those of 
order Ic, then in each sector of the complex plane within which none of 
the quantities RMyn(x) - 7rto11 change sign there exists an actual 
matrix solution F(x) which is continuous with its first derivative in x 
and analytic in X, and is throughout the sector identical with S(x) to 


terms in 

X 


In virtue of this there exists in any sector of the type described a 
pair of actual associated matrix solutions of the form 


t y(i) = («*<-,«♦»,•<*) [«,.,) k-l) 

( Z(x) = (e" xr »(*) _fl »(*) , 


provided only that the matrices R(x) and B(x) are differentiable 
k terms. In all cases the functions [$</]*-, which occur in the expres¬ 
sions for I'(j) and Z(.r) are, of course, respectively identical, to terms 
in l/X* -1 with the series [d„l in the formulas (37) and (38). 
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In the course of the deduction of forms (45) it was assumed through¬ 
out that n ^ 2. Direct integration of the equation, however, shows 
that in the case n = l there exist solutions which are of this form over 
the entire plane. Accordingly formulas (45) may be used in every 
case. The explicit forms 

1 > 

Z(x) = (r*r<W-®.W j j), 

with <py, hj hounded, hold if R(x), B(x) are differentiable. 


Y(x) = | Sii + X) . 


Section VII. 

The characteristic values of the system 

Y'(x)-{R(x)\+B(x)}Y(x)- 
W m Y(a)- + W b Y(b)--0. 

It was shown on page 65 that a necessary and sufficient condition 
that the vector system 

r- = \r\ + b\ y. 

(46) W a y(a). + w b y(6). =0 (I w a | * o, | w b \ * 0), 

has a solution is that 

(47) I W m Y(a) + W b Y(b) | = 0, 

Y{x) being any matrix solution of equation (26). Let us make the 
specific form of this condition in any sector of the X plane apparent by 
substituting a F(x) which is analytic in X for | X | > N, and which has 
within the sector the form 

(48) Y(x) = ( e * r ,*»♦*,« [«,,]), 

as determined in the preceding sections. Choosing the a of formula 
(20) as a = a it follows that y(a) = (t5*/l). 

Substituting from (48) in the determinant on the left of equation 
(47), (call it D{\)) we have 
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D(X) = | W a ([5,,]) + W h (c xr > ib) + B > <b %,)) |, 
i.e. 

(49) D(\) = | hr*?] + k'4’1 

In this as in subsequent formulas when the expressions in question 
are determinants the letters r and r rather than i and j arc used to 
indicate row and column respectively. 

Due to the fact that those and only those values of X which satisfy 
equation (47) are characteristic values, i.e. values which yield solu¬ 
tions of system (40), equation (47) is known as the characteristic 
equation of the system in question. 

The introduction at this point of quantities 5 |; and will do much 
toward simplifying the further discussion. These quantities are de¬ 
fined by the relations 

( = when R{\Ti(b)\ gO =0 when R\\r,Q>)\ g 0 

■ 

( = 0 when R\\Ti(h)\ >0, = hi when R\\T,(b)\ > 0. 

It is to be noted that and are functions of arg X alone. 

Suppose now that the rays 9 /flxr,(6)| =0, t = l,2,...n, are 
drawn in the plane of the parameter X and let one of the sectors 
bounded by a pair of adjacent rays of this kind, R\\V k(b)\ = 0, and 

/?|Xr, (6)| = 0, be denoted by a u . Then if 6^ and 6 ( £ z> are respec¬ 
tively the 6 * and 6*; for some (any) particular value of X within <r u , 

we have the identities 6*- = 6*} = 6 {l $ t for all X in the interior of 

the sector in question. It becomes apparent in virtue of the relations 

W£] when R[\T e (b)\ < 0 

(50) [*«1 + [*%] + B f (6, = [ ^ )] cXre(6) +flf(6) 

when /{|xr e (6)| > 0 , 

and the fact that /t|XI\(6)| 4= 0, t = 1, 2,. . .n, that D{\) takes, for 
any X in the interior of such a sector the form 

9 A half-line issuing from the origin of the X plane will be referred to as a ray. 
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(51) 


D(X) = I [«#) 5* c + ) a;; +*'<*> |. 


Factoring from the determinant the exponential factors, any one of 
which occurs in each element of an entire column, the further alterna¬ 
tive form 


(52) 


d(x) = n c 1 kr ‘ (W +B ‘ (W 14 “ 11 C + luffl C 


is obtained. It becomes necessary at this point to differentiate be¬ 
tween certain types of conditions which inherently characterize any 
particular system of type (46). 

The conditions of the system will be said to be regular (i) if n = 1, 
or if, when n ^ 2, arg r,(6) =£ arg i =*= I\(6)| when j 4= t, and (ii) the 
boundary conditions are such that each of the determinants 


(53) 


If 


r(*/) 


„,<o) *<*/) 

w re b ee 


+ 


re 


Akl) 

6 ee 


differs from zero. 

The conditions of the system will be called irregular if either of these 
conditions is not satisfied. In the further discussion we shall consider 
first the case of regular conditions and then the particular type of 
irregular case which results from dropping the regularity condition (i) 
above. 

Case I. Regular Conditions. In this case the rays /?|XI\(5)| = 0, 
i = 1, 2, . . .n, are distinct and divide the plane into 2 n sectors of 
type °ki’ Let us fix the attention upon any one of these rays, say 
the ray /flXI\(6)| = 0, and let <r and denote the abutting sectors, 
the former being that within which /?|XI\(6)| < 0. For X on the 
ray R{XT y (b)\ = 0 the * tk column of Z)(X) consists of terms whose 
form cannot be abbreviated by means of relations (50). Retaining 
therefore the original expressions for the elements of this column we 
have 

D(X) = | [«#) + [.r?2l latf + 5 C ,| c xr « (6) +Se(M I . 


For X either within or within <J pr , on the other hand, the form of 
D(X) is given by formula (51), the difference in the value of D(\) in 
these sectors being accounted for by the difference in the values of 
Sec and s *c, c = 1, 2,..., 71. 
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But it is readily seen that 

(54) a ( L r) = & - = S ( £ + 3... 

Hence if S is any sector which includes the ray Jt|XI\(&)| = 0 and 
overlaps a part of each of the sectors <r |ir and we have, for a X in S, 
the following expressions 

D(\) = | [«.%! a;; + M5I s';;' e x w)+»« , »>|, for x within 

D(X) = | (»%] a'” 1 + MJ>] la'"’ + |J«*r.»+*«| i for X on the 

ray II |XI\(t)| = 0, 

B(X) = I M2] la'"' - 8 e ,| + Mill la'-;’ + a e .| r^W + ^l.forX 

within <r, rt 

It is readily seen from this that D(\) is given for any X in S by the 
formula 

(55) D(X) = | hr'?] 8<{” + [.r^'llaif* + i«.M r ' (w + 


• • 


Factoring from this determinant the product II e l*r*(M+ a*<&)| 

k-l 

we have 

(56) D(\) = n^lr 4( 6)+fl*(6)|« ( K> d{\), 

k-i 

where 

B(X) = I [wif] «S?> + + U ^«*.(W+a,( 6 ) |. 


Now D(\) is seen to have elements consisting of a single term in every 
column but the v*, the elements of that column being binomial. Con¬ 
sequently the expression of the determinant as the sum of two others 
is possible, i.e. 

D(\ = | MM a‘f’ + MM «£’ I +1 MMUiM - U + 

M?H*J? +a.,)k r '»>+*'<»> 

or, in view of relations (53) and (54), 


(57) 


Z>(X) = [W> 9 ] + [BH e r,x(6H*,<6> # 
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Since the roots of D(\) =0 and those of D(\) = 0 are the same we 
have, therefore, as the characteristic equation 


(58) [W M ] + [H' <rr) J = 0. 

This yields, since W {tAV) 4= 0, the equation 

T u/O 1 *)! iitO**’) 

xr ^b)*BAb) _ _ __ _ ___ 

e - [ ifmJ - jr<"> 


+ 


where < is here introduced as a generic symbol for functions which 
approach the limit zero uniformly as | X | increases beyond limit. 

Solving for X, and observing that log (K + «) = log K + we see, 
therefore, that every characteristic value which lies in sector S is of 
the form 

(59) Xp = r^6) I " B ' (6) + log +« ( x p) + 2 p * j - 

where p is a positive integer. Moreover, since y(x) is analytic in X 
throughout the sector S, | X | > A’, the same is readily seen to be true 
of « also. 

Consider now a small circle of fixed radius r drawn about the point 

— ) 

— B.{b) + log —+2 pmy 

for any given p. Then for a proper choice of origin in the X plane 
(see page 98) this circle lies entirely within the sector or tr„ T , and 

< r for | X | > N. Also if p is sufficiently large the point 

1 ( ) 

— j - B.(b) + log-^ + ‘ + 2 P™ 5 




lies within the circle for all X on the circumference. Consequently 

arg i x - r^6) i - B ' (6)+108 »■'"> + * +2pvi \ i 

increases by 27r as X describes this circumference, and just one root of 
equation (59) is accordingly seen to lie within the circle. Since it is 
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readily verified that every such root is a characteristic value it is seen 
that (59) determines such a value for every p which is sufficiently 
large, i.e. that for large values of X the characteristic values lie ap¬ 
proximately along a line parallel to the rav /f|Xr,(/;)} = 0, the dis¬ 
tance between two adjacent ones approaching as a limit the finite 
length 2 jt/ | I \(h) |, as | X | increases indefinitely. From the deriva¬ 
tion of this result it is seen moreover, that a similar sequence of 
characteristic values lies near each ray li\\r t (b)\ = 0, and that no 
further distribution of characteristic values exists. 

Case II. A Type of Irregular Conditions. 10 I>et us suppose now 
that the regularity condition (ii) is fulfilled, that n ^ 2, but that 

(60) arg r,(6) = arg I =*= I\(6)| 

for the pair of values i - v u j = ^ We have then a case of irregular 
conditions, and while we shall restrict the discussion to the case when 
the relation (<»0) holds for only a single set of values i, j, the reasoning 
to be employed is typical and may be applied with equal success to the 
cases in which a greater number of the points I\(6) arc collinear with 
the origin of the complex plane. It is only for the sake of brevity that 
the simplest, rather than the most general case which results from 
dropping the regularity condition (i) is treated. 

A review of the discussion applied to the case of regular conditions 
readily shows that the methods employed there apply equally well to 
the case in hand and yield the same results in any sector of the X plane 
which does not contain the line /f|xr ri (6)| = 0. It is, therefore, 
necessary to consider here only the distribution of characteristic 
values in a sector 6* containing this line. Along the line in question 
neither the expressions for the elements of the •'j* nor of the ^5 column 
of D{\) can be contracted by means of the relations (50). Accord¬ 
ingly it is found that D(\) takes the form 

(Gi) D(\) = 11*#] + [»S?1 Ia£? + ^ < xr ‘ (6) +a<(6) I 

• •• 

throughout the entire sector S, and representing the quantities 
i« and 6" for X on the ray fl|XI\,(6)| = 0 which lies in S. Factoring 


10 For the discussion of a differential system representing a different type of 
irregular conditions see Hopkins, loc. cit. 
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from (61) the product n xr * (6) + s * (6) 1 6 kl we have 

D(X)- n e l xr * <w+B * (M l*« 3 (x) , 


(62) 

where 




m =i [v>w:i + [»^i {«2+«... * xr '' <6)+fl ' i<b) 

+ «..,« xr '* <k>+B ' ,(6) H- 

The determinant Z)(X) is seen, therefore, to have monomial elements 
in each column except the v\ h and the *4\ the elements in these 
columns being binomials. It is clear, therefore, that Z)(X) can be 
expressed as the sum of four determinants each containing only 
monomial elements, i.e. 

(63) 2>(X) - I [4?] «S? + [4?1 «£ I + 114?] M? -«... I 

+ k«]U« + ic.ll « xr ' ,<kHB ' , <‘> + 

I M?1US? - ic.,1 + (4?!U« 4- ««.,! |e xr '» lt)+s -' (M + 

I (4?]I*«* - «... - ic.,1 

+ [»S?]I«5 + «... + «»,l |c x l r„(i').r„,6, |*s, 1 (buB„(i) _ 

Let us again denote by j and a„ the sectors abutting on the ray 
R {Xr yi (6)) = 0 in which R\W mi (b)\ < 0 and > 0 respectively. It is 
necessary to consider the case in which arg r„ (6) = arg I\ f (6) and 
that in which arg 1^(6) = arg { — I\ f (6)|. 

Sub-case A. arg T ri (6) = arg r, t (6). 

In this case the quantities R\\r„ i (b)\ and R{\r„ t (b)\ are of the same 
sign throughout the sector S, and the relations 


(64) 


dee 
• • 

:<*r) 


+ 3*. + 6 Cr . = Si? = 5 


• 

(4 

cc 

• • 

c*) 


*ie - be.. - = dee = 3. 


’Cr 


ee 


'cc 

• • 

?(#»»’) 

'cc 
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are readily verified. Defining the determinants W\ and W r by the 
formulas 

h’, = i «<? i i*£>+«,.,i i 

(65) • •• 

= I «&’ 1C + « e .,l + «{? IC - *«.,! I 

we have Z)(X) = D(X) where, in view of (64), D(X) satisfies the equa¬ 
tion (56) and is given by 

(66) 25(X) = lll < " ) ) + III',) e MV.(»>*».,<*> + [H/j 

Sub-case B. arg r r ,( 6 ) = arg I — r, a ( 6 )|. 

In this case il|xr r| ( 6 )) has throughout S the sign opposite to that of 
R{\r^(b)\, and it is found that 

V 67 ) .. .. 

*<"> t — t< r > _ !<»•»’) t 

6 ee — d C r, — 6 ee - 0 ee — 0 e „ t . 

Accordingly we have 

25(X) - [»’,] + + |ir ( ")]e kr » ( '’)* B -<‘> 

If now we define Z)(X) by the relation 

5(X) e - x «v6>-B„<*> = 5(X), 

it is readily verified on the basis of formulas (67) that D (X) again 
satisfies equation (56) while it is given in this case by 

( 68 ) D(X) = [W M ] + (H',1 e xr '« (6)+B '. (6) 4 . [H' 2 )e“ xr ^ <B) ” a « w 

-f [\V {wr 

Let us suppose now that the notation has been so chosen that 

r r ( 6 ) 

| I\ ( 6 ) | ^ | T.( 6 ) |, and set —-— = r. Then upon dividing 

T^( 6 ) 

equations ( 66 ) and ( 68 ) through by (recall that W {lu * 4= 0 by 


>i f M rr,( 6 >-r^(t*) . 
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hypothesis) it is seen that the characteristic equation, i.e. D(\) = 0, 
is in each case of the form 


(69) 


1 + [ Cl ]e xr '* (6) + [c 2 ] e rXpr,<w + M e l 1+, } xr *'i (6) = o. 


where c„ i = 1, 2, 3, are complex constants, c 3 0 by hypothesis, and 
r is a real constant r ^ 1. 

Wilder 11 has shown that the roots of this equation are asymptotically 
represented by those of the equation 

(70) 1 + ci * xr " (6) + c, e rXr '* (6) + c* e^ r ) xr '-< 6 > = 0, 


and has discussed the distribution of the roots of this equation. We 
shall proceed to this discussion, observing, however, first that when 
| r„ (6) | and | I\ ( (6) | are commensurable a far simpler treatment is 

V 

possible. In that case r = - where p and q are integers and the 

xr„<&) 

equation (70) is an algebraic equation of degree (j> + q) \n e ~ 
Accordingly it has (p + q) roots, i.e. 
x r„<6) 

« Q = j = 1 , 2 ,..., (p + q) 


from which it follows that 


(71) 


X* 


r„(&) 


I log a, + 2kwi) . 


In this case, therefore, the characteristic values which lie in sector S 
for | X | > A r are asymptotically represented by a set of points which are 

2qir 

spaced at intervals of length |p ~ ^ ~ | on (P + *?) lines (not neces¬ 
sarily distinct) parallel to the line 7l{Xr r| (6)} = 0. 


When I\,(6) and I\ t (6) are incommensurable no such simple treat¬ 
ment is possible. The distribution of the characteristic values may 
be obtained, however, by Wilder’s procedure, 12 which follows. 
Setting XI\,(6) = z = x -+- iy we have as the equation (70) 


m = 1 + C,e' + + c, e l ,+ M* = 0, 


11 Wilder, loc. cit., p. 423. 

12 Cf. Wilder, loc. cit., pp. 420-422. 
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and it is readily verified that there corresponds to each choice of an 
arbitrarily small positive constant x» some value of x, say x = X, 
such that 


(72) 


(i) |1 -/(-*) I < X, 

00 I c » — I < X. for * S X. 


We shall assume that x is chosen sufficiently small to preclude the 
vanishing of /(z) outside or on the boundary of the region | x | ^ X. 

Now /(z) is analytic throughout the entire finite plane, and hence 
it is possible to find in any interval of the Y axis, however small, some 
point y — yo which is such that the line y = */o contains no zero of/(z). 
Let y = Yi and y = Yt be any two such lines and consider the rect¬ 
angle K bounded by them and the lines x = A' and x = — X. We 
shall determine the number of zeros of /(z) within K by observing the 
increase in arg/(z) as z describes the perimeter. 

We have arg/(z) = sin" 1 where 7l/(z)| denotes the coeffi¬ 

cient of V—1 in the expression for /(z). Moreover, for y = constant 
11 f(z )) has the form 


win 


where the coefficients, d it i = 1,2, 
zeros of /l/(z)l> and hence those of 


3 are real constants. The finite 
\ ^ v‘ # being the roots of the 

I/to I 


equation 




are, however, separated by the finite zeros of the derivative of the left- 
hand member, namely by the roots of the equation 


d 2 \r — 1] + d*Te z = 0. 

Since this equation is satisfied by at most one value of x it follows that 
vanishes at most twice, and consequently that arg/(z) changes 

I /to I 
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by less than Ztt as x varies between the limits x = —X and x = -f- X 
along a line y = constant. 

Because of the relation (72i), however, we know that for every a 
on line x = —X,f(z) lies within a circle of radius x about the point 
z =l, and hence that as z moves along this line arg/(z) changes by 
less than 2 sin" 1 x- Similarly relation (72ii) shows that arg |/(z)/e (l+>) * | 
changes by less than 2 sin -1 x as z moves along the line x = X. From 
the identity 

arg/(z) = argel l+r l* + arg 

it follows, therefore, that the increase in arg/(z) as z moves along the 
line x = X from y = Yi to y = Y 2 lies between (l+r)( Y 2 — Fil + 
2 sin" 1 x and (1 + r) | Y 2 — Y i| — 2 sin -1 x- Consequently the in¬ 
crease in arg/(z) as z describes the perimeter of K lies between 

( 1 -fr) (F*—I'll + 6ir+4 sin -1 xand (1+r) { Y 2 — Fil — 6 tt— 4 sin _, x» 
and accordingly the number of zeros located in the interior of K must 
lie between 

(l+r)lF,-Fi| ...2 


+3+ - sin-* x 


and 


2ir 7T 


d+r) |y,-y,i „ 2 . , 

- -3-sin-’ x 

2 TT TC 


Since x ca n ^e chosen arbitrarily small, however, this means that the 
number of characteristic values between any two lines y = Ci and y = 

Ci-|- / is at least ^—- / — 3, and cannot on the other hand exceed 


2tt 


1 +r 

2?r 


/ + 3. 


Summarizing the results it is seen, therefore, that the characteristic 
values in sector S lie in a strip bounded by two parallels to the line 
(b) | = 0, and that they are so distributed throughout this strip 
that for | X | > N no more than three lie between any two lines which 

* 2 fl¬ 
are at a distance d < - from each other and are perpendicular 

1 + r 

to the line R\\T y \{b)\ = 0. 
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Section VIII. 

The formal expansion of an arbitrary vector. 

It was shown in the preceding section that both under regular 
conditions and under the type of irregular conditions discussed the 
characteristic values for system (40) are numerable and cluster about 
the point X = ®. Denoting these values by Xi, X 2 , X 3 ,. . . it is possible, 
therefore, to assign the subscripts in such manner that | X m | ^ | X mf! |. 
Moreover then lim | |X m | = «». 

ra-ce 

Assuming that system (46) is simply compatible at the characteristic 
values there exists for each of these values just one solution of the 
system in question and just one solution of its adjoint system. These 
solutions for X = X* will be designated by Y {k) (x)- and *Z w (x) respec¬ 
tively. 

Now if X = 0 is a characteristic value for system (46) let the para¬ 
meter be changed by setting X = X + r, c being a constant. Equation 
(46) then becomes 

rW- - I flWX + BWI Y(x)- t 

where B(x) - c R(x) + B(x). 

The characteristic values of the system thus modified jire X = X*—c 
and it is clearly always possible to choose c so that X = 0 is not a 
characteristic value. No loss of generality is entailed, therefore, by 
the assumption, which will be made, that X* 4 1 0 for any k. 

Writing the equation (46) for X = X* in the form 

y ( *>'(x). = B(x) Y' k \x)- +X*/?0r) Y {k) (*)• 

and considering this as a non-homogeneous equation we have from 
page 67 

6 

(73) Y {t \x) ■ =\,f G(x, t) R(t) Y w (t)-dt, 

a 

where G(x, t) is the Green’s function for the system 
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(74) 


y'(ar). = B{x) y(x). 

w a y(a>- + w b y(6)- = o . 13 


In precisely similar manner we have from the adjoint system 

b 

■Z w (x) = - X, f Z w (t ) R(t) H(z, t) dt, 

a 

or, substituting from the relation 

G(t, x) m - II(x, t). 


(75) 


b 

■Z w (t) = X, f -z m (x) R(x) G(x, t ) dx. 


Consider, now, the integral 

6 b 


•Z <0 (*) R(x) G(x, t) R(t) Y' k \t)-dx dt. 

a a 

In view of relation (73) we have 

6 

X.J = f -Z w (x)R{x) Y lk \x)-dx, 


while it is seen from (75) that 

b 

\,j= f •z (,, d) «co y < * , (t) d<. 

a 

By subtraction, then 

(X*- X,) J = 0, 

and it follows that J = 0 provided Ar 3= /. But then \ t J = 0 , i.e. 
6 


(76) 


f-Z in (*) fl(x) Y fk \x)-dx = 0, for fc * i. 


13 Since X = 0 is not a characteristic value of system (46) system (74) is 
incompatible and the Green’s function exists. 
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It can, moreover, be easily shown (see page 107) that when the 
system (46) is simply compatible at every characteristic value, this is 
not true for k=l, i.e. 

b 

J -Z (l) {x) R{x) Y il \x)-dx 4: 0, for any l 

a 

Let us suppose now that an arbitrarily chosen vector F(x)- can be 
developed into a series of the form 


(77) 


F(x)- = Z Ck Y ik \x)’ . 


*-i 


Multiplying both sides of this equation on the left by the vector 
•Z <0 (*) R(x) and integrating term by terra we have formally 
6 ^ 

[ -Z w (x) R{x) F(x)-dx = £<•* f -Z w {x) R{x) Y w (x)-dx, 

i *-■ • 

which in view of relation (76) reduces to 
b b 

(78) J-Z w (x) ftto F(x) dx - c, f-Z ( '\x ) R(x) Y w (x)-dx. 

a ° 

Inasmuch as the matrix on each side of this equation is one all of whose 
elements are identical the equation may equally well be written 


!/ 


2 zPtoTJkWAW 

A-i 


"1 


( 1 ) 


whence 


Cl 


•■{/i 

b 

fzzlP(x)y k (x)/ k (x)dx 

J A-l 

a __ 

6 

f Z*i n (x)y k (x)y[ n (x)dx 
J A-l 


2 4° to to viPto 


j(l)> 
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Consequently we have 


CD 


(79) *•(*)•- L 

k -1 


t> 

f 2 z[ k) (x) y h {x)f h (x) dx 
J A-l 
a 

b 

f 2 z{ k, (x) 7 *(i) yV\x) dx 

J A-l 




If, therefore, F(x) • may be developed into a series of form (74) which 
converges in such a manner as to legitimatize the processes above, 
then (79) is a necessary form for the series in question. 

Thus far it has been stipulated only that Y (k \x)- and •Z ( * ) (a:) be 
respectively solutions of system (46) and its adjoint for X “_X*. But 
each of these systems is homogeneous, and if y (A:) (a:)- and -Z (k) (x) are 
any particular solutions then cY (k \x )• and c^Z (k) (x) are also solutions. 
Having chosen a definite pair Y (k \x)- and -Z {k) (x) we have then 



If in particular c is chosen so that 

b 

I ;/2 «i«(*) 7*(x) dx = 1, 

and the vector c-Z {k) , for this value of c is associated with Y {k) * so that 
the choice of one implies the choice of the other, we have, on dropping 
the bars over the letters, 


(80) F(x) 


L 

A-1 


0 

/ 


2 z ( k k \x) y k (x)f h {x) dx 

k-i 


Y«\x) 


In using this formula it must be remembered that •Z (k \x) is deter¬ 
mined as soon as the particular y (4) (z)* is chosen. 
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In a precisely analogous manner it may be found that a development 
of the type 

■n r) - E e t # k \x), 

k -1 


which converges in such manner that it may be integrated term by 
term after being multiplied by any of the vectors R(x) F (,> (*)*, must 
necessarily coincide with the expansion 

f * 1 




CO 

E 

k -1 


/ 


2/»W 7* (*) !/»“(*) d* 

K -1 


■Z' k \x). 


Inasmuch as the matrix G(z, I) is not a vector the methods outlined 
above do not apply directly to the problem of expanding the Green’s 
function. We shall proceed, therefore, as follows. ^ 

I.et G,( x, 0 • denote the vector each of whose columns is the j col- 
umnof C(x, t). From the relation (80), FO) being replaced by G,(x,t) •, 
it follows that for any value of I, Gfe, !)• can be formally developed, 
the scries obtained being 


(81) <?,(*,<)• 


L 

k-l 


b 

f Zxi k \x) 7»(x) 9h> (x. 0 dx 



In view of equation (75) written in the form 

b t 

(*}°(0) - X, J 2 7»(*) J*/ (*. 0 dx. 


(81) reduces to 

G,(i, 0 • = E y {k \x): 

k-l ** 

from which we obtain the relation 

CO 

(82) ?./(*. 0 = E 

A-l 


X* 
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But since and Y ik) (x)- are vectors 

y'*’(*)• = ( Z y?\x) 4 k \t)j = n (s,S«( Z ) 4 W W). 


Hence we have from (81) 
(83) 


CO 


0(x, I) = L 

4-1 


y , ‘ , (i)-z < * , «) 


nXi 


Having obtained this formal development of the Green’s function 
it is possible to state the 

Theorem: If the development (83) converges to G(x, t) in such a 
manner that a uniformly convergent series is obtained by multiplying 
(83) on the right by any matrix of continuous functions and integrating 
term by term, then any vector F(x) • the elements of which are con¬ 
tinuous and have continuous first derivatives, and which satisfies the 
boundary conditions 


H’ a F(a)- + ir*F(6)- = 0 


is represented by a convergent development of the type (80). 
Proof: The relation 


F'(x)- = B(x) F(x )• + C(x)- 

defines the vector C(x)-, and inasmuch as F(x)‘ is then a solution of 
the non-homogeneous system 

}"(*)• = B(x) Y(x)- + C(x)- 
H'al'(a)- + H’^TO)- = 0, 


it is given by the formula 


o 

F(x)- = JG(x, t) C(t)-dt (see page 20). 


Substituting for G{x, t) its series development we have 


F(x) * = J ^- dt > 
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or, integrating term by term. 


(84) 


F(x)- = Z Y ll \x) 


*-1 


6 

4- f m-dt 

n\ k •' 


Now given any matrix of type D- t and another of type *A-, we have 
D”i4* = diaj = (nodi), i.e. 

(85) D-A- = naD- 


Hence (84) reduces to 


F(x)- 



b 

f 2 z[ k \t) c h (t) dt 
J H-l 


X* 


Y (k \x)- t 


the series on the right converging uniformly to /•’(*)• Q. E. D. 

In formula (12) the explicit expression for the Green’s function 
G(x, t) for an incompatible system of type (6) is given, Y h and Z h being 
respectively a solution of the differential matrix equation Y'(x) = 
A(x) Y(x), and of its adjoint. Now for any X, not a characteristic 
value, system (46) being incompatible is precisely of type (6). Conse¬ 
quently we have from (12) 


(86) <?(*,*,X) - (?(*,<,X) - Y(x,\)A-'(\)\WMa,t,\) +_ 

W b G(b, <,X), 

for any X not a characteristic value. 

Let us investigate now the functional dependence of G(x, t, X) upon 
the parameter X, choosing as the solutions Y(x t X) and Z(t, X) a pair 
which are analytic in X. Recalling that when the determinant of a 

matrix A is denoted by A then A~ x = where A a is the cofactor 

of the ijf h element of A, the explicit form of A _, (X) is seen to be 


A-»(X) = 



14 This notation holds when n = 1 provided it is agreed that when A is a 
matrix of one element then A» s 1. 
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Now it was seen (page 87) that at a characteristic value D(\) 
vanishes. Let us assume further that the characteristic values of 
system (46) are all simple so that D(X) vanishes at each of these values 
to only the first order. Due to the relation 

(87) I D *i I = I)n “ 1 ' 

which is familiar from the theory of determinants, 15 it follows, then, 
that Dn(\k) cannot vanish for all i and j, since in the alternative case 
the left-hand side of (87) would vanish to the order n, while the right- 
hand side vanishes only to the order (n — 1). Hence (D*<(X*)) 3= 0, 
and A"*(X) has a simple pole at each of the characteristic values. 
Moreover, system (46) is simply compatible at the characteristic 
values as was assumed above. It is easily verified (i), that the poles 
of A-*(X) actually persist in G(x, t, X), and (ii), that G(x, t , X) has no 
other singularities. 

Denoting by G {k) ( x , t) the residue of G{x, t, X) at X = X* we obtain 
from (86) by familiar methods 

,.. , Y«\x) (D„(X*)) \W.Y' k \a) - W>Y ik \b)\Z k ® 

G (a-, t) - - 1 t » 

which, inasmuch as ^ D(X)j x _^ i}: 0, is of the form 


( 88 ) 


C* M (x, 0 = Y (k \x) C'*' Z w (t). 


From this it follows that for fixed t, G {k) (x, I ) is a solution of the system 


(89) (y'(x) = |fl(x)x*+B(*)) r(*) 

(H'.y(«) + H'»F(6) = 0 . 

But if Y(x) is a particular solution of the differential equation of this 
system every solution is of the form 

Y(x) = Y(x) C. 

Substituting in the boundary conditions as on page 64 and denoting 
by (da) the A which results from Y = Y we have 

15 Cf. Bdcher, loc. cit., p. 33. 
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(90) (<*</) (c«) = 0 , 

n 

or 2 </ i4 Cy = 0, i,j, = 1,2,.. .n. 

/-i 

Now it was shown that (Z)y(X*)) 4= 0, ••<*. that the rank of Z>(X fc ) is 
(n — 1). It follows that the solution of the system of linear equations 

(91) Zrf,iC| = 0 i- 1,2,...n 

/-i 

is unique, namely that the Cy of system (90) is the c, of system (91) 
for all j. ^Consequently the solution of system (89) is unique and of 
the type Y(x)C •. One solution of (89) is known, however, namely 
Y {k \x)'. Inasmuch as G’ (4, (x, 0 was also seen to be a solution in x it 
follows that each column of G {k) (x, t) must be the same as the general 
column of Y {k) (x)- except possibly for a factor independent of x. 
Accordingly g\ k) (. r, t) = cj 4, (0 y[ 4> 0r), 

from which C ( *’(x,0= - y <W (*)-•C < * , («). 

n 

Hut from (88) it is also seen that G lk) (x, t) is, for fixed x, a solution 
in t of the system adjoint to (89). Since this solution is again unique 
and -Z lk) (t) is a solution we have 

g\? Or. 0 = d w <*> ci k \t) = c!*>(x) zj 4, «), 

i.e. cj*>(0 = c< 4 > z< 4) «). 

Hence (x, t) = c (4) y\ k \x) z} 4, «) 

or 

r <*> 

(92) & k \x, t) = — y (A, (*).-z (4) (o, 

n 

the value of c (4) being as yet undetermined. 

Writing equation (46) in the form 

F w> '(*)- = {R(x)\ + B(x)\ Y w {x)- + |Xi— XI R(x) Y ik \x)-, 
and considering this as a non-homogeneous equation, we have 
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b 

y<*>(x)• = - |x — Xi| Jg(x, l, X) R(t) Y fk) (t)-dt. 

a 

Inasmuch as 

lim |X - X t ) G(x, t, X) = G (k \x, t), 


this yields 





G ik \x,t) R(t) Y ik \t)-dt, 


and substituting from (92) we see that 


J * 

— y (W (x)'-Z li \t)R(t) Y fk> (t)-dt, 

a 71 

But in view of the relation (85), 

6 

Y ik) (x)-f-Z ,k, (l) R(t) Y w (t)-dt = 


n 


b 

f z xf'd) v*W vPWdt 
J h-l 


Y m M- 


It follows that 


b 

= - c ( *> f 2 zi k \t) 71,(0 yi k \t) dt, 

J A-I 


or upon associating with Y {k) (x )• the proper •Z ( * ) (x) (see page 101) 
that c {k) = —1. 

Hence 

(93) G w (x,t) = - - Y {k \x)--Z {k \t). 

n 

This result enables us to deduce an expression for the sura of m 
terms of the series development of an arbitrary vector F(x )• Thus 
we have from (77) and (78), (*Z*and Y l • being associated in the 
manner stated) 
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b 

J-Z m (x) R(x) F(x)dx = c,(l). 


Multiplying this equation by y ( 0 (x)* on the left it becomes 

b 

e| y®W-(l) = / K <n (i)- •Z <,> (l) R(t) F(t)-dt, 


which in view of (85) and (89) reduces to 

6 

(94) c, Y w (x) - fc w (z, 0 ltd) F(t) ■ dt. 

a 

This expression for the I th term of the formal development was de¬ 
duced upon the hypothesis that X| is a simple characteristic value. 
If at X| a number of characteristic values X/„ Xj,,.. .X/„, coincide, we 
shall define the term of the formal series which corresponds to this 
value of X to be 

b 

- /c< o (x,0fi(0 Fd)-dt. 


In every case then we have a formal series which is completely de¬ 
termined and of which the l ,h term is given by formula (94). It is to 
be noted that G ,(0 (x, 0 for the case in which X, is not a simple charac¬ 
teristic value is not given by the left-hand side of (94). It must be 
computed for every such case individually. 

Now let pi be any contour in the X plane which surrounds X/ but no 
other characteristic values. Then by use of the relation 


jG(x t X) dX - G* 0 (x, 0 
we have from (94) ^ 

C| y«>( x) . = - JL J J G(x, I, X) rfX R(t) F(t) dt, 

^ 9 *I 

or, upon interchanging the order of integration and choosing a contour 
Cm which encloses the characteristic values Xi, X*,.. .X m and no others, 


(95) 


2> y (fc) Gr). = - 
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Section IX. 

The Convergence of the expansion. 

From the distribution of characteristic values as found in section 
VII, for both the cases of regular and irregular conditions there dis¬ 
cussed, it becomes readily apparent that given any constant N suffi¬ 
ciently large it is always possible to choose a circle C whose center 
is at X = 0, and whose radius both exceeds the value N and is such that 
the distance from any point of C to a characteristic value is greater 
than some fixed constant 6 > 0. 

We shall consider the convergence of the contour integral in formula 
(95) as the contour of integration is taken successively as a larger and 
larger circle of the type C above. With the size of the circle the num¬ 
ber of characteristic values which it includes and hence the number of 
terms of the series which are summed by the integration may be in¬ 
creased indefinitely, the limit of the integral for | X | = being, if it 
exists, the sum of the corresponding series. 

Let us recall the hypotheses already made concerning the functions 
y t (x). We have 


(i) 

7<(x) 4= 0 ) 


(ii) 

7i(x) continuous a ^ x ^ 6 V 

(page 72) 

(iii) 

if n ^ 2 7 <0*0 4= 7/0*0 for t 4= j I 


(iv) 

if n £ 2 arg(7<(x) — 7/0*0) = h <i 

(page 83) 


To these we shall add 

(97) arg 7 <(x) = //< (a constant), i = 1, 2,.. .n. 

Concerning the vector to be expanded we shall assume 

that the elements /<(x), t = 1, 2,.. .n, consist in the interval 

(98) a ^ x ^ b, of only a finite number of pieces each of which is 
real, continuous, and has a continuous derivative. 

It will readily be seen from (96) and (97) that we have restricted 
each 7 ,(x) to vary along a ray in the complex plane. Moreover, if 
n ^ 2 the dependence of 7 /(x) upon x must be such that the slope of 
the line joining any two of these points remains constant. This 
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means that every n sided polygon with vertices at the points 7 /(xo) f 
j =1,2 . .n, a ^ xo^ 6 ,'at most expands or contracts about X = 0 
as x 0 is allowed to vary. 

If the lines along which 7 /(x), j = 1, 2,.. n, vary are all distinct, 
the conditions of the system are regular provided W a and W b are 
suitably chosen. If, however, one or more sides of any of the polygons 
mentioned lie on a line through the point X = 0, we have the irregular 
case discussed in section VII. In particular all the sides may lie on 
such a line, (for instance the quantities y>(x) may be all real) and it is 
upon this configuration that the irregular case in question bases its 
chief claim for interest. Since the condition (96 iv) is automatically 
fulfilled in this case the functional dependence of 7 >(x) upon x is far 
less restricted than when the points yfat) form the vertices of actual 
polygons. 

Substituting in formula (86) the value Q(z, t) = = Y(x) Z(t) we 
have 

G’(x, t, X) = Y(x) i * J / + j A-'\li' a Y(a) - H\Y(b)\\Z(t), 

the upper sign holding for t < x and the lower sign for i > x. It is 
desirable in the following work to express 6 ’(x, t, X) in a somewhat 
different form. 

Upon setting 

+ § / +} A-* | ir.y(fl) - ir*y(&)) - («*,) + u, 

multiplication by A yields 

i H'oYia) + J H'*y(6) + } Il c Y(a) - i H\Y(b) = 

|ir a y(a) + ir 6 y(&)| (8»)+au, 

which, in view of the relation (6*j) + (6~) = /, reduces to 

W.Y(a) (6~) ~ II b Y(b) («•) = A U. 

Hence 

U = a-' H. >'(<<) (O - A - 1 («*,•)• 


Now setting 

- \ I + \ A-*|H’ a y(a) - W>Y{b)\ = - (6~) + V, 
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it is found by precisely the same method that V = U. In consequence 
we have for the Green’s function 

f ( «•) ) 

(99) G(x, t,\) = y(x) | | or^ j- + A- 1 II a Y(a) (6,>) 

- A-' H't,y(6)(S?j) | Z(t), 

where the upper form is to be chosen when t < x, and the lower one 
for t > x. 

By means of this formula G*(.r, t, X) may be explicitly represented by 
choosing as the solutions )'(*) and Z(t) a pair which are analytic in X 
and have the forms 

y(*) = (c xr > w * B t x) j a« + |) 

z(t) = + 

obtained in sections V and VI. It should be observed that G(x, t, X) 
is unique (see page 70), despite the fact that if n ^ 2 the choice of 
y(ar) and Z(t) thus determined upon changes from any one to any 
other of the sectors within which no quantities RMyM — y>(.t)|| 
change sign, and despite the fact that the values of 6 i; and 6, ; change 
from any one to any other sector 

From (95) we have, upon denoting by S m (x) • the sum of those terms 
of the formal development of F(x) • which correspond to the charac¬ 
teristic values enclosed by the circle C in question, 

b 

(100) S m (x)- G(x, t, X) It(t) Fd)-dt rfx. 

C a 

Substituting in this the value of G(x, t, X) as found above in (99) it is 

4 

seen that S m (ar)- = £ S„(x)- t where the quantities are given 

i-1 

by the relations 
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( 101 ) 


«S?W- 


X 

f f Y(x)(i‘j)Z(t)R(t)F{t)-dt d\ 
c 2xi J u 


C" 


0 

S"\x)-= L^-.f f YW(S~)7.(t)mF(t) dtd\ 

C 2xi ** *' 


Cm r 


0 

S2?(*) - Z —. / r(x)A-' f W.Y(a)(i")Z{t)imF(t) dtd\ 

C 2 7TI 


C^. 


o 

s ( £M =L^-.C Y(x)A-' fir>Y(b)(s;,)Z(t)mt)F(t) did\, 

c 2x1 J Y 


C M , denoting uny arc of circle C which lies within the sector and, if 
n ^ 2, upon which none of the quantities R |X|-y,(x) — 7/WI change 
sign. If C ^ abuts upon a ray bounding <r„, it shall either include or 
exclude the end point for which the quantity R | Xr„(6) | or the quantity 
R |XI\(6)| vanishes according as the quantity in question is < 0 or 
> 0 within <r, M . The arc C„ r may, therefore, include one, both, or 
neither end point, and since the reasoning is precisely the same in euch 
case (the sector can be split if both end points arc included) we shall 
consider for the sake of concreteness that it includes one, namely that 
one which lies on the ray R |Xr r (6)J = 0. The symbol 2 indicates 

that the sum of the integrals over all arcs composing the circle C is to 
be taken. 

We shall proceed to evaluate each of the integrals above in turn 
and in the course of this evaluation it will be convenient to refer to the 
facts established by the following lemmas. The notation | tp(x) |<3/ 
will be used to indicate that the function in question is bounded. It 
is not to be understood that the 3/ in any one case represents the same 
constant as in any other, but merely that there exists in each case 
some constant for which the relation is true. 


Levima 1: Given any function v>i(x, X) which is such that 


(102) 


(i) | vAx , X) | < 3/ fora £ * g 0, | X | > N, 

(ii) lim | <pi(x, X) | = 0 uniformly, for a g x ^ 0 — x. 
x-® 
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where x is an arbitrarily small positive constant, then 

0 

lim / *pi(r, X) dx = 0 . 

|x|-® u 

a 

Proof: Under the hypotheses (102) we have 


lim 

|X|*cn 


0~X 


X) fix = 0 uniformly, 


and 


p 

| f ip x (x,\) dx\ < M X , 


0-X 


while it follows from these relations and the relation 

| J *>»(*, X) dx | ^ | f v i(ar, X) dx | -f | J v>i(*, X) dx |, 

« « t-x 


that 


p 

j* X) 


dx | < 2 M x 


for | X | sufficiently large. Since x is arbitrary, however, this means 
that 

o 

lim / <pi(x, X) dx = 0. Q. E. D. 

IXl-ool " 

Given any function v> 2 (X) which is such that 

| <p 2 (X) | < 3/ for 0 a £ arg X ^ e fi , | X | > N 

lim | v> 2 (X) | = 0 uniformly, for0 a ^ arg X ^ Op - 
lx|-« 

where 0 X is a constant arbitrarily small but positive, then 

V O0 

C.„ being that arc of the circle | X | = p which lies in the sector bounded 
by arg X = 6 a and arg X = 6$. 


Lemma 2: 
(103) 


(i) 

- 

00 
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Proof: Writing X = pe %i we have 

V>*(X) = wipe 10 ) = faiO, p). 

Hence 




lim — J <p 2 (X) — = lim — J +*{8, p) dO, 

ixl— 2 «cL, X 2ir t 

and since the limit of the integral on the right is zero by lemma 1, 


lim — J **(X)“—0. 

W-® 2wl c a0 X 

I.emvia 3: Given any function y>j(x, X) which is such that 


Q. E. D. 


(104) 


(i) | <pa(x, X) | < M fora g x g ft 0 a g urgX g Off, | X | > N, 

(ii) lim ^j(x.X)|= 0 uniformly fora ^x^0—x. 

|X|-co 

0 a ^argX £ 0ff-0 x , 


then /(X) = J ^ 3 (x, X) dx is a function of the type v> 2 (X) of lemma 2. 

a 

Proof: When X is confined to the sector 0 a ^ arg X ^ 6 0 — 9 X , 

lim /(X) = 0 uniformly by lemma 1, while for X in the larger 
N-® 

sector 6 a ^ arg X £ 0 S 


/(X) 


o 

| S f Mdx = M(fi - o) . 


Hence /(X) is of type <p 2 (X) by definition. Q. E. D. 

As a matter of convenience we shall use hereafter the symbols 
<p u >p 2 , and v>3, to designate any function of X and other arguments 
which satisfies conditions similar to (102), (103) or (104) respectively. 
As heretofore functions which approach the limit zero uniformly as 
| X | increases indefinitely will l>e denoted by «. I^et us return now to 
the direct evaluation of S m (x)-. 
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I. &£(.*)■■ 

Writing 

* 

Ji = - f Y(r) (««) 2«) Jf(!) 

a 

we have from (98) 

«??«• = ~fj l d\. 

Ixi c 

From this it is immediately seen that 
(105) S"\u)=0. 

Consequently it is necessary to consider further only the case x =£ a. 
We have 


0 -xr ai «>-* a C!> 


y,.= - ( f L eW+ B *''{6, k +l/\Mz)]\s:, 

\ j k.m.p.q -1 v 

U mp + i/xi* mp (0]i* P ,7,(0 /,«) <«)), 

i.e. 

X 

( 106 ) •/.-=-(/ c x I r ( <I » _r i <,> I * *j <x >- B i ,, > + 

X 

l/X [f i x c M^)-r*(i>hB 4 U)-s 4 (i> ^ 7p(0/p(0l6ftp **(*) + 

0] -*). 

For X on any arc C^ ( however, each term under the sign of summation 
in the last matrix on the right of this expression is seen to be of the 
type X). Their sum therefore is of the same type, and by lemma 
2 the integral is of type y>»(X). Consequently we have, upon integrat¬ 
ing by parts the elements in the first matrix on the right of (106), 15 

Jv = «*.■ j -JA.X - 0) + c xr ‘ l ' , ' 8 ‘ < ' ) /,(a + 0) + 

X 

f ^Mt)c B -' z) - B <"' j*J ) + ^ 

15 Recall that I\(a) = 22, (a) = 0. 
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Accordingly it is seen that Jr can be considered as a sum of ma¬ 
trices of which the first three are discussed as follows. 

For the first matrix we have directly 

l - («*/.(* - 0)) = J («;,) F{X - 0)-. 

The second matrix 

- (&*****-**!&)) 

A 

is one each of whose elements is seen either to vanish or to be of type 
on arc C„. Hence the entire matrix may be represented 
by 

The third matrix 

X 

- 'flu f e x l -j- {/,-(<) \dt 

X\ ” at 

is likewise one each of whose elements either vanishes or is, on arc 
C„„ of type ^*/X by lemma 3. This matrix is, therefore, also of the 
type M/K and inasmuch as a sum of matrices of type (*> 2 ) is again 
a matrix (*> 2 ) we have 

J, = \ I (So) F(x - 0) • + (*.) | 

A 

The integration of Jr over the arc yields, therefore, by lemma 2 
<*£*> F(, - 0)-+ m j 7 = u f v tfSr*) - «•+ w. 

where is the angle subtended by at X = 0. 

A similar expression results from integration over each arc similar 
in character to C„,. Hence we have 

«!?(*)— L ^ (&) Fix - 0).+ («). 
c 2 it 

Let us consider in detail the sum 

(107) I 3c («£•’). 

c 2x 
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Corresponding to each arc C^ there exists another arc C*; which is the 
reflection of C^ in the point X = 0. Since also subtends the angle 
at the point X = 0 we can write the sum (107) equally well in the 

form 


(108) 


E 2= («!f' + «p). 

1 n 9 -w ; 


the summation covering now the arc of only one (any) half of circle C. 

But it will readily be seen that Sfj*) = tiff*, for quantities which have 
a positive real part in have naturally a negative real part in tr-;. 

Hence tiff* + 5 < gT ) = 5,/, and the sum (103) i.e. (107) reduces to 


L 

\c 2 IT 


i /. 


It follows, therefore, that 

(109) l <«5f*) = i L 

c Z* 

In consequence we have 


(110) limsffw-- JF(*-0). 

m- co 


//■ sfffr) -- 

The treatment of this expression is parallel to that of S ( J,’(x)* and is 
as follows. 

Writing 

= f Y(x) (S';>Z (t) R(l) F(t)-dt, 

* 

«!?(*>•- 

Hence 

(111) «S?(«— 0, 


we have from (101) 
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and it is necessary to consider further only the case x 4= b. We have 


Jr = 


= ( f I U*+ 

\? k.m.p.Q -1 


5., + 


•i*W 1 I r** 

“x - )*" 

^j*P.T,(0 /.«<«) 

-( f I [«*,*<*(*) 

xW *.p-i 

+ « it ^ P (0]d<). 


Each integrand in the last matrix of this equation is of the type 
V>i(*,X) and accordingly the matrix is, by lemma 1 of type («)/X. 
Integrating by parts the elements of the first matrix on the right we 
have 


Jt . = I ^ 4 “ j _ e k l J,(b - 0) + Mx + 0) + 

JV(r i u>-r j «>| i | /((t) „»^>-»«»«) A j) + Y 

* 

Therefore, Jr can also be expressed as a sum of matrices, the first 
and third of which, namely 


and 


I „M PiM-iyM !♦*((*>- B > {b) f i (b'^J 
I J* f M riM-W I ± | /,(,) | _ 


are of the tvpe («)/X on arc C„.. for each element either vanishes or 
approaches ‘the limit zero as | X | = The second matrix of the 
sum is directly 
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(!“/.(*+0)) = l ~ (S~)F(x + 0)-. 

A A 

Hence 

Jr= \ l(««) #■(* + <))• + («)), 

A 

which integrated over gives 

J- /{ «!?’) F(x + 0)•.+ (.)} J = ^ (J'p 1 ) F(x + 0) • + («). 


By familiar reasoning this leads to the equation 

«»(*)• = E ^ (4'p) F(* + 0). + («)» 

C 2t 


and inasmuch as 


(112) 

L 2* (s <?>) = i i, 

c 2t 

as may be shown by applying again the argument by means of which 
relation (109) was established, we have 

i.e. 

S*Or).-J/(* + 0) ■+(.), 

(113) 

lim S!i'(x)-= J F(x + 0) • 

m- co 

HI- S ',«(*)-. 


Following the procedure used in the discussion of the preceding 
expressions, let ^3* be defined by the equation 

b 

J 3 ‘ = — 

Y(x) (O 2(<) fl«) f(<W< 

a 

Then 

Now 

affW— 

A -1 — 

v Z)(X) A 

16 See note on page 104. 
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Z)(X) its value as given by equation (56) we have 

1 

*-I 

*£» _ 

l r( xr *'* , **‘ ,A >l D(X) 

*-l 

-| xrI i£> , 

ZHX) 

where D(X) is given for the case of regular conditions by formula (57) 
and for the case of irregular conditions discussed in section VII by 
either formula (66) or formula (68). A glance at these formulas 
shows, however, that in each case the expression for D(X) reduces, 
under the conditions for which the expression is valid to the form 

D(X) = H' 1 "' + *>,, on arc C„. 

Now every point of a circle C, and hence in particular of an arc C„ 
is at a distance which exceeds a fixed quantity from any characteristic 
value. However, by analogy with formula (59), for_any point of C„ 
under regular conditions for which the function D is sufficiently 
small we have 

X = —) j - B,(b) + log+ « + 2 P«i\- 

In other words the point in question will necessarily lie near one 
of the characteristic values. 

This stands in contradiction with the fundamental property of the 
circle C. 

Inasmuch as a similar relation holds on every arc of type C^, 
it follows that for every point of all circles C, Z)(X) exceeds, under 
regular conditions, some fixed positive constant 61 , i.e. 

(114) |ZXX)|^5i>0 for X on any circle C. 

While we have thus proved the relation (114) only for the case of regu¬ 
lar conditions it can be shown to hold equally well under the type of 


and substituting for 

A-> = 
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irregular conditions considered in section VII. 17 In either case, there¬ 
fore, A -1 is uniformly bounded on all circles of type C. 


Proceeding, we have 


F0r)A-*= £ e xr A (*H« 4 (,) j a + I 

*-1 ( x ,) Z>(X) 


e x\ r 4 (x)-i \(b) s,i |♦* i (x>-* l .(6) 5 


r\^)) 
( DM J 


, £ ^x|r 4 (x)-r 4 (6)aIi|4R 4 (x)-B 4 (6)«JJ k*MI #/*(X) 

^ *-i X Z>(\) ‘ 

Since the last matrix of this expression is of type («) we have 
Y(x) A->= ^ e x l r *<*)- r <<«*i7|^.(*>-a,(6)C | + € | y 


Y(x) A-' = 
or, more briefly, 


(115) K(x)A-'- (e x I W-WHi 1 ♦»,(*)-*,;<»)»« r .. (x) ) i 

where | r w (X) | < 3/ for | X | > N. 

Inasmuch as DM was seen to be of the form 

DM = H’<'■'> + v>2 

for X on arc C„„ we have for any X on this arc 

or, denoting by the matrix whose determinant is 
016) (r.y) = G^> + (**). 


+ ». ■ \ V 

+ ** + V \H /(Mr V + ( * 2) ' 


17 Cf. Wilder, loc. dt. p. 422. 
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We have, further, 


o 

f W. Y(a) (O Z(l) m F(t)-dt = 

O 

f i j i o- xp - , °-'- <0 

J A.*.m.p .«-1 ( X ) t 




E tols’C* f 7»w aw <*) 

A-l 


L W$s r* f 7,(0 / p (0 (0 + «*„ *&’) A 

A.A.p-1 J 


L 

A-l 


_r kr »>*>- a »'»'/, (6 - o) +/»(«+ o) + 






the last form representing as heretofore the result of an integration 
by parts. The multiplication of this matrix by the matrix (115) 
yields the result 


- 1 
J ’ = T 


E e x l r ^>- r «“‘Cl^* u, - fl ^ )5 ur < *(X) uftbZ[ 

k. A-l 


6 

j A(a+ 0)-r kr ‘ ,6, - B ‘ ,6, /*(i-0) + / e- xr * ,n j t IA(0«‘ s * < ")A 
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the principal matrix on the right again breaking up into three compo¬ 
nent matrices. Of these the first, 

x \k,h-i e ' 

and the third, 



£ e x \ r i (*)-r < (fc)aJJ | 
k.h-i 


b 


are for all values of a: clearly of the type («)/X. Only the second 
matrix of the sum, namely 


r it (X)4Wj,(a+0)J , 


remains, therefore, to be considered. It depends for its character 
upon the value of .r. 

For x 4= a, x 4= b, the matrix is clearly of type (<*> 2 )/X. For x = a, 
on the other hand, the exponential factor common to the elements of 

the i th row reduces to |*tf — f so t h at 

j 3 (a)-= — f £ 8a r ik (\) w$ &Hkfk(a + 0) + <\ . 

The fact that 

Ji(b)' = f biiT ik (\) ip** S kh f h (a -|- 0) -f <Pi) , 

A 'k.K -1 / 

may be established in similar manner by use of the relation 

Substituting for (r <; ) its value as given by formula (115) it is readily 
seen, therefore, that on arc C p , 
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j, ( o) • = — (aip) w. (ajp)«« + °) • + WI 
x 

j,(b) • = — 1 (dh n ri H'.wir')«« + 0) •+(*>.) I ■ 

X 


It follows from this that 

= £ — /*(«)— when i + a, i + b. 

c 2ri c.. X 




£ jL/ i—( a;r>) n , .(iSr , )f'(a+°)+(»>.)i^ 


c 2ti c - 


when x = a 


£ —./1 — («ir’)n , "' > «'•«£’> f(«+0)4Wl^ 

C 2irl ^r 


when x = 6, 


or, upon defining the matrices Ki and K» by the equations 


(117) 


K,~z\-T ^ W ' ( & ] 

c * w 

K,= L j - ^ («£’) «'• (*£’> \ ■ 


(118) lim Si'(*) • 

rn-co 


= 0 when x t fl, i + ^ 

= Ka F(a + 0)- when x = a. 

= KaF(a+0)- when x = 6. 


It should be observed that the values of Ki and K a depend only 
upon the differential system whose characteristic functions form the 
terms of the expansion in question and are in particular entirely inde¬ 
pendent of the vector F(x)- 


IV- SffM- 

Proceeding precisely as in the case of SS?(*)- we have upon defining 
Ji • by the relation 


420 



BOUNDARY PROBLEMS AND DEVELOPMENTS. 
b 

J t - = Y(x) A-'fw„Y(b) (S‘j) Z(t) R(t) F(t)■ dt, 

a 

dx. 


Further 

t 


f W h Y(b) («’ ) Z(t) R(t) F(t) ■ dt = 

a 

b 

fe yMM ) dt 

h ~ l 

b 

l h L «,<*> tlje'l r *<‘)-r*«,|♦»*<»-»»«> w/p(0 [aw 

A h.k.p -1 


4- 6* p ^>a*(&)] df 


= 7 L »8P j - A(6 - 0) + e x WW>f h (a + 0) + 

A h-l ^ 

6 

f e M r *<‘>-r*«)) ± (,■*»>-»*<•>/,(,) ) *) + i<2> 

a dt ) X 

and multiplying this matrix by the matrix K(x) A-‘, as given by 
formula (115) it follows that 

•'*•=£ i#ww" i +**-«>»- Tit ( x ) w % S ' kK _ A(6 _o)+ 

6 

/r A (6 , tflA(6 , /A(a+0)+ f e x | r A ,6,-r 4 („, | + <£>. 
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The second matrix of the sum on the right, namely 

A / 


is seen to be of the type (w)/\ whereas the third, i.e. 


1 

X 


n 

k.h-i 


6 

T(*(X)«riV i»A f' M r *“>- r »«‘»l I /»(/) dt 


is also of this type, as is apparent from the fact that by lemma 3, each 
of the integrals with respect to t which actually occurs in any element 
of the matrix is of type •fit, while the remaining factors are bounded. 

The remaining matrix of importance in the expression for J\- t 
namely 



A.A-l 


again depends for its character upon the value of x, being of the type 
(v» 2 )/X when and only when x 4= a and x^b. By reasoning now 
familiar the results 


Ji(a) • = — r ik ( X) W l kh thhfhib - 0) + <p^J 

= ^ (O V- 1 ^6 («* ) F(b — ” W, 

id 

M b ) • = — ( E C T,-*(X) 6 AA /* (6 - 0) + y> 2 ) 
x 'A.A-l / 

= ^ (5**) W'* ($:,.) F(6 - 0). + l - (<p 2 ), 
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may be deduced, whereupon it follows that 


s£?(*) 


= £ 7T~- ^ (**) "T when x ^ a » * =♦= b > 

c 2m X 

= w\(ag 


* 2iri 






C 27Tic 


(«Sr*) r(6-o)-+(»>.)) ^ 

A 

when x = a. 

(«Sr ,, )n < *' ) ir»(«5f>) f(6 -o)-+ Ml ^ 


-I 


***’ 


when x = b. 


Defining the matrices K\ and K\ by the equations 

o^' 1 r, («‘r>) 



k*-l i 

- 

(119) 

c ( 

2 T 


S 4 =Zj 

- W** 


C ( 

2r- 

it follows, therefore, that 



(120) lim S ( *’(x) 

m-co 


= 0 when x ^ a, x ^ b 

= Ka F(b — 0) • when x = a 
= Ki F(b — 0) • when x = b. 


A summary of the various results as contained in formulas (110), 
(113), (118) and (120) is seen, in virtue of (105) and (111), to yield 


lim S m (x) • = $ b\x — 0) • -f h F(x + 0) • when x a, x 4= b, 

m-co 

lim S m (a)• = \ F(a + 0)• + K,F(a + 0 )-+K t F(b -0)•, 

m-co 

lim S„(6)• = * F(b - 0)• + K 3 F(a + 0 )■ + K* F(b- 0) •, 


m— co 


In consequence we have the following 

Theorem: Given that L is any vector differential system of the 
type 
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y'(x)-= [A(x)\ + B(x)\Y(z)-, 

Wo Y(a)-+w b Y(b)- = 0, 

which can be reduced by a change of the dependent variable to a 
system of type (46) for which (a), R(x) and B(x ) are continuous to¬ 
gether with their first derivatives (6), the functions 7 /( 7 ), j = ), 2, ... n, 
satisfy the relations (96) and (97), and (c), the condition (ii) on page 
89 is fulfilled. Then the development in characteristic functions of the 
system L which is associated with any vector F(x)‘ whose elements 
satisfy condition (98), converges to 

\ F{x - 0)- + i F{x + 0)* when z+o,i+6, 

II a F(a -f 0) • + J a F(b - 0) • when x = a, 

H b F(a 4- 0)- 4- Jt>F(b - 0)- when x = 6, 

the four matrices of constants H a , J a ,_Hb, anc \_J b being explicitly 
determined by the matrices R(x), A(x), W a and W b as stated. 
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(Vorgclegt von Urn. Bikiiirmach.) 



Bei (ler Untersuchung gewOhnlichcr Differentialgleichungen 


dt 




(i = i, • • • n) 


spiclen in (ler Nahe eines Punktes, z. B x, = • • • = x m = o, wo alle die reellcn 
analytischen Funktionen AT, vcrschwinden, gewisse Reihcn einc wichtige Rolle. 
Wenn diese Reihcn konvcrgieren, so ist es nicht schwcr. angcmessenc Fragen 
flber die Nntur der Lfaungen zu bchnndeln. 

Wenn abcr diese Reilien divergieren, so scheint die Sache ganz anders 
zu liegen. Es ist die Absicht dieser Arbeit, im einfachsten Falle cine Ver- 
mutung dber die Bedeutung dieser divergenten Reihcn zu begriinden. Vor 
der Behandlung dieses Falles soli die Vermutung selbst aufgcstcllt werden. 

Einem solchen singularen Punkt mbgen forxnale (n — A)-dimcnsionale 
Fliichen S t zugeordnct sein (£<n). Diese seien (lurch Gleichungen 

F, = o • • • F k = o 

gegeben, wo die F, • • • F t (konvergente oder divergente) Potenzreihen in x, • • -.r„ 
ohne Absolutglieder sein mflgen. Als Folge der Gleichungen F, = o • • • F k as o 
und der Differentialgleichungen m6ge 

dFi = o 

formal richtig sein. Offenbar ist der Schnitt zweier solcher invarianter for- 
inaler Flachen selbst eine invariante formale Flache. 

Weiter rndgen formale, (lurch Potenzreihen dargestelltc Intcgralc 

fl, = const. 

existicren, und zwar entweder auf einer solchen formalen Flache S t oder im 
x, • • • x.-Rnum (k = o). Es kann der Fall eintreten, daB ein einer positiv 
definiten Porm auf S k entspriclit. In diesem Falle zcigen offenbar gewisse 
Losungen der Differentialgleichungen (nSmlich die im ublichen Sinne S t be¬ 
ll*) 
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nachbartcn) Stabilitfit, derart, d&fi x,--x„ fur cin langes Zeitintervall in der 
Nfthe von x, = o • • • = o bleiben, und zwar sowohl fQr wachsende wie 

fur abnehmende Zeiten, ohne Rucksicht auf die Konvergenz oder die Diver- 
genz der vorkommenden Reihcn. Wegen dieser Eigenschafl wollen wir dies 
den stabilen oder halbstabilen Fall nennen, je nachdexn die Dimension 
eines oder des anderen der S k gleicli n ist, oder stets kleiner als n ist. 
Allc anderen Falle sollen vOllig instabil heifien. Der stabile oder hnlbstabile 
Fall kann nur dann eintreten, wenn die Koeffizienten in den Kntwicklungen 
von A' • • ■ X m nach x, • • • x, unendlich vielen Bedingungen genflgen. 

Die erwfthnte Vermutung ist nun die, daO im v6llig instabilen Fall die 
divergenten Reihen durcli Funktionen eines einfachen Typus ersetzt werden 
kiinnen, welche im reellen Gebiet dieselbe Rolle spielcn wie die im Falle 
der Konvergenz durch die Reihen dargestellten Funktionen. 

Wir wollen diese Vermutung im einfachsten Falle n = 2 bestAtigen. 
Hier sind die Wurzeln m ,, m, der in m quadratischen Gleichung 


(0 


3A. dX, 

dX, 3 A, 


wo in den partiellcn Ableilungen x, = x, = O cinzutragcn sind, von grund- 
legcndcr Bedeutung 1 . Wenn m,, rn, reell und von einerlci Vorzeichcn sind, 
doch so, daO keinc der beiden Zahlen cin ganzzaliliges Multiplum der anderen 
ist, so konvergieren die auftretenden formalen Reihen <p, yj/, und die allgcmeinc 
I.Osung kann in der Form 

* = 4 = 


geschrieben werden. Wenn wir dann die so gerechtfcrtigte annlytische 
Variablentransformation 2, = <p, 2, = \l/ machen, so nclimen die Diflcrential- 
glciehungen die cinfache Normalform 




an. Daher ist die AuflOsung in der Tat auf diesen einfachen Fall reduziert. 
Ahnlich herrscht Konvergenz, auch wenn m ,, m, konjugiert komplex, aber 
nicht rein imaginar sind. Fine analoge Normalfonn spielt dann eine wich- 
tige Rolle. 

Es bleibt dann der Fall ubrig, wo m ,, m, reell und von verschiedenem 
Vorzeichen sind. Hier kOnnen die Reihen if>, yl divergieren. Auf diesen 
Fall wollen wir nun unsere Aufmerksamkeit richten, und wollen dazu die An- 

nahme machen, dafl — irrational ist. 

_ m . 

1 Zu den erwahnten Tatsachcn vgL man L. Bisbfpbacii, Theorie der Diflerentialgleichungen, 
Berlin 1926, Kap. 3. 4. 
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5 


Zuerst wollen wir das folgende Lemma beweisen: 

Lemma. Die formale Potenzreihe, mit reellen Koeffizienten 


s(x ,, xj = 2 




m!n! 


*r*.\ 


mOge konvergente Zeiien und Rolonnen haben, so dafi sie in jeder 
der beiden Formen 

<-(*■)*■" Z- *.(*.)*: 

m\ n! 

■=• ■ = o 


geschrieben werden kann, wo»,(i,), f„(x,) in den komplexen Vnria- 
blen x,, x, analytisch sind fur |x,|<r, |x,|<r, r>o. Dann gibt 
es eine reelle Funktion f(x ,, x,) der reellen Variablen x ,, x,, die 
mit alien partiellen Ableitungen fdr |x,|<r, |x,|<r stetig und fiir 
x, 4= °» x » =£ ° analytisch ist, und fur die 


ist. 



= cw, 



Wir werden eine solche Funktion in der Form 


(>n, n = o. I, a •••) 


explizite hinschreiben 1 . 

Zunfichst betrachten wir diese Funktion in dem vierdimensionalen Berelch 


*. <r 


x » I *s r »I ar g x, | <^ 1 1 arg 


In diesem Bcrcich ubertrifft arg x\ x\ nicht 4 S< — f so dafi die Kxponential- 

ausdrucke in der Reihe Exponenten mit negativcm Realteil haben. 

*'■ » 

Sie sind daher dem absoluten Belrag nacli kleiner als Kins. 

Weiter finden wir fur |Z^..|>i 




b.. 2\(b ." - 


Wenn aber | B mn | < i ist, dann haben wir nach der Definition von B mn 


I*;*.’ ’ 


, „ ?: f F ‘ R,rr b , at ail ” ,,che mit der gleichen Metl.ode behandelt, mit der Absicht. 
eine Funktion anzugeben, die an eincr Stelle gegebene Ableitungen bat. Vgl. seine Arbeit -On 
the derivatives of a function at a point-. Annals of math, (a) vol. 18 ( 1916 ). 
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Da tier Ausdruck in der Klammer der vorigen Abschatzung eincn absolute!) 
Bctrag kleiner als 2 bat, so gilt die zulctzt aufgcscliriebcnc Abschatzung fur 
allc B mn . Daher ist 

4 — 1 *.| — 

eine inajorante der gegebenen Rcihe. Naeh Voraussetzung konvergieren aber 
die Glieder m < 2 , n < 2 in der Dopj»elreilie s gleichmftBig. Dalier konver¬ 
gieren die entsprechenden Glieder in der Reilie / ebenfalls gleichmaBig. Denn 
die eingeffihrten Fnktoren liaben einen 2 niebt (ibersteigenden absolute!) Bctrag. 
Die majorante Reibe zeigt, daQ die ubrigen Glieder («i> 2 , w>2) aucli 
gleichmfiGig konvergieren fur alle bescbrflnkten x,, x,, fiir die |argx, |<S, 

|nrg x, |<$ und o<S<-^-. Daber ist f(x, , x,) eine stetige Funktion, die 

o 

fiir x,4=0, x, :£o analytiscb ist, weil ein wenig groQer angenommen 
werden knnn. Trngt man also x, = o oder x, = o in die Reibe ein, so erhftlt man 

/(x,, o) = a.(x,), /(o, x.) = /.(x.). 

Auf abnlicbc Weisc kann man die Reilicn diskutieren, die man durch suc¬ 
cessive particlle Differentiation der Reihe fiir / erhiilt. Wir wollen nur nocb 
3/ 

bctrachten, da analogc Unglcicbungen fflr allc anderen Ableitungen Icicht 
nacb demselben Verfabren gewonnen werden kttnncn. Fulirt man die Differen¬ 
tiation explizite aus, so siebt man, daQ die Glieder in der Reibe far aus 

ox, 

zwei Termen bestehen; der eine bat denselbcn allgemeinen Typus wie in der 
Reibe fiir / un<l wird durch Multiplikation des typischen Gliedcs dieser Reibe 

mit erbalten. Dieses Glicd entstebt durch Differentiation des Faktors vor 

*1 

der Klammer. Der entspreebende Teil in der abgelciteten Reibe bat also als 
majorante die Reihe 

^ (m— 1 )! n! 

Der anderc Teil der abgeleiteten Reibe bat als typisches Glied 

— 1 

— 2 e 


m\n\ 


B mm x, 


I)a bier der Exponentialfaktor einen absoluten Betrag kleiner als 1 besitzt, so 
ist eine majorante dieser Reihe 


i*.r 

m\n\ 




Daber ist cs offenbar, daQ die ganze Reibe fur in dem glcicben 
Gcbiet gleicbmiiQig konvergiert, da die Reibe aus den drei ersten Zeilen und 
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Kolonnen gleichmiflig konvergiert. D.her ist in diesem Bereiche stetig. 

und aufler bei x, = o, x, = o an.lytisch. Setxt man x, = o, so siel.t man, 

,i a (j fur x = o den vorgeschriebcnen Wert I, (x.) annimmt. 

3x, 

Diese Schluflweise kann fur und die partiellen Ableitungen hoherer 

Ordnung verwendet werden. ’Die Funktion f(x, , x.) ist daher erne reclle 
Funktion, die in dem ersten Quadranten x,>0, x.^o mit alien ilircn par¬ 
tiellen Ableitungen stetig ist; sie ist analytisch fOr x, o, x, o. Hire partiellen 
Ableitungen nehmen fiir x, = o und (ur x, = o die gegebenen W erte an. 
Ahnlich zeigt man, dafi / die gleichen Eigenschaften in eincm jeden der dre. 
anderen Quadranten hat. Die vier Funktionen bilden natOrlich zusammen 
die durch die gegebene Reihe definierte Funktion, welchc die angegebenen 
Eigenschaften hat. 

Der Satz, den wir mit Hilfe dieses Lemmas beweisen wollen, kann so 
ausgesprochen werden: 

Satz: Es indge ein Paar von Differcntialgleichungcn 

( 2 ) =: X(x,,x,) t -jj- = X(x ,, x.) 

gegeben sein. Die reellen analytischen Funktionen A',, A', der re- 
ellen VerSnderlichen x,, x, m6gen an dem singulliren Punkt (o.o) 
verschwinden, wfthrend die Wurzeln m,, in, von (i) entgegenge- 

setztes Vorzeichen besitzen mOgen. — sei irrational. Durch eine 

mL 

eineindeutige reelle analytische Transformation mdgen die inva- 
rianten Kurven durch (o.o) in die x, und die x, Achse abcrgefuhrt 
werden, so dafl 

X, = x, (m, + P(x ,, x,)), X, = x, (rn, + Q(x,, x.)) . 

Dann kann man eine weitere Variablentransfonnation 
*\ = *. /. (x., x.) , x, = x,f (x,, x.) 

vornehmen, in der/,,/, zusammen mit alien partiellen Ableitungen 
stetig sind fur |x,|<r, |x,|<r, r> o. Es ist dabei f (o.o) =/, (o.o) 
= i. / und/ sind auBer bei x, = o, x, = o analytisch. /, und/, be¬ 
sitzen partielle Ableitungen, die bei x, = o in x, und bei x, = o 
in x, analytisch sind. In den neuen Variablen bekominen die Dif- 
ferentialgleichungen die Form 

dx\ . dx\ 

— _m,x,, —j m, x,, 

wie in den anderen allgeineinen Fallen. 
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Es leuchtet ein, dafi dieser Satz vollkommen AufschluB uber die reellen 
LOsungen von (i) gibt. 

Zum Beweis bemerken wir zunachst, dafl die Form der Differential- 
gleichung, die wir durch Einfuhrung der invarianten Kurven als Koordinaten- 
achsen erliielten, bewabrt bleibt, wenn man irgendeineTransformation der Form 

X, = £, x, = x, (I + <r (£,, £,)) 


vornimmt, in der p , <r durch konvergente Potenzreihcn in S ,, x, ohne kon- 
stanteGliedcr gcgeben sind. Wir wollen zunflchst eine passende Transformation 
dieser Art vornehmen. 

Wenn man in ( 2 ) x, = o setzt, so ist die zweite Glcichung befriedigt, 
und die erste wird 

= x > ( m < + P (*' • °)) • 

Ilieraus folgt 

' = /x,(m, + P(x„o )) - *.+«<*.))■ 

wo a(x.) bei x, = o in x, analytisch ist und bei x, = o verechwinden mbge. 
Also wird 


Durch Auflbsung erhalten wir 

x, = x, = o, 

wo f(u) analytisch ist und bei u = o verschwindet. Ahnlich linden wir 
eine LOsung 

x, = 0 , *. = *-'(1 + 9 (e m '')) t 


wo </(u) analytisch ist und fur u = o verschwindet. 

Die in Aussicht genommene zusatzliche Transformation ist 
x. = i.(i+/(!,)). X, = f.(i 

Naturlich lassen «lie so erhaltenen neuen DifTerentialgleichungen dann die 
bciden l.bsungen 

x, = e m,t , x, = o; x, = o, x, = e" 1 ' 
zu. Setzt man diese LCsungen in die transformierten DifTerentialgleichungen 
ciiif so findet man, daG 

/>(«"', o) = o, Q[<>,!*'•) = 0 

ist, so daO P(x, , x.) und Q(x,, x,) durch x, bzw. x, teilbar sind. Daher 
kbnnen wir die DifTerentialgleichungen in der Form 

( 3 ) = x,(m I + x,/?(x lt x,)), ^ = x,(m, + x,S(x I ,x,)) 

annclimen, wo li und S analytisch in x,, x, in derUmgebung von x, = x t = o 
sind. 
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Das ist die endgfiltige Form, insoweit wir sie durch eine eineindeutige 
analytische Transformation erreichen kOnnen 1 . , 

Es ist wohl bekannt, dafi die fruher erwfilmten formalen Reihen », * 
bis auf eincn konstanten Multiplikator eindeutig durch die formalen Identitaten 

t \ 


■ m ,<p = o 


y a* 


X,-£- m .+ = o 


bestimmt sind und tatsachlich lineare Glieder enthalten. Wenn die vor- 
bereitete Form ( 3 ) der Differentialgleichungen bcnutzt wird, so bemcrkt man, 
dafl in <p keine von x, unabhangigen Glieder vorkommcn kOnnen. Wenn in 
der Tat $(£,) die formale Reihe dieser Glieder bezeichnet, und man die von 
x, unabhfingigen Glieder in der ersten der obigen partiellcn Differential¬ 
gleichungen betrachtet, so findet man 

dip 


woraus £ = o folgt. Also ist <p durch x, teilbar. Ahnlich findet man, dafl 
4/ durch x t teilbar ist. Wenn wir also die linearen Glieder in <p und >1 zu 
x, bzw. x, fixieren, so sind die in diesen formalen Reihen auflretendcn Multi- 
plikatoren bestimmt 

Unser n 8 chster Schritt ist, zu zeigen, dafl die Reihen <p und die im 
Lemma vorausgesetzte Eigenschaft besitzen, dafl nkmlich die Zeilen und die 
Kolonnen konvergieren (siehe '). Aus SymmetriegrQndcn gcniigt es, diese Tat* 
sache fflr <p zu beweisen. Um zu zeigen, dafl die Zeilen von <p konvergieren, 
schreiben wir 

•P = M*«. (*•) + *• W*.-*-••) • 

Es genOgt, zu zeigen, dafi man Funktionen *.,*,,*,••• wahlen kann, die 
fur |x,|<r analytisch sind, fQr die (o) = 1 ist, und derart, dafl die partielle 
Differentialgleichung fQr <p formal befriedigt ist. Wenn man nfimlich diese 
einfache Reihe in x, in eine Doppelreihe entwickelt, so wird naturlich die 
partielle Differentialgleichung weiter befriedigt sein. Da x, das Anfangsglied 
in <p ist, so ist die so crhaltenc Doppelreihe mit der oben fixierten identisch. 
Daher besitzt die Doppelreihe konvergente Zeilen, denn die Summe der Glieder 
in der m-ten Reihe ist gerade *.,x"■*' , . Um die Behauptung zu beweisen, 
setzen wir die Reihe fur <p in die partielle Differentialgleichung ein, streichen 
einen Faktor x, un<l setzen die Koeffizienten von x? gleich Null. So erhalten wir 


(m, n + (n -+- 1 ) x, P(o , x,)) a, + ffl,x,*; + *, = o, 


wo in 4* nur 
so kommt 


vorkommen. Setzt man zuuachst n = o und 4», 


= o, 


wo c 0 durch die Festsetzung bestimmt ist, dafl ct a = 1 fur x, = o. Daraus 
folgt, dafl « 0 (x.) in dem Gebiet |x ,\<r der komplexen Ebene analytisch ist, 

1 H. Dulac bat Transfonnationen dicser Art angegeben. Vgl. seine Arbeit *Sur les 
cycles limites*, Bulletin de la Soc.niatli.dc Fiance, t. 51 ( 1923 ). 
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in (lem P{ 0 , x.) analytiscb ist. Wir wollen d.mn (lurch Induktion bcwciscn, 
dafi a„, a, • • • nacheinandcr so bcstimnit werden kfinncn, dafi sic allc in .r,|<r 
analytiscb sind, wofcm r so klcin ist, daB P uud Q fur |.r,|<r, x,|<r 
Analytiscb sind. Nehmen wir also an, dafi die Behauptung fur « ot 
riclitig ist uud beweisen sic fur u m . Zunachst ist 4*. oflenbar fur |.r,|<r 
analytiscb, da es cine linear<■ Funktion von ist 

init Kocffizicnten, die fur |a*,|<r analytiscb sind. Lftseu wir explizitc auf, 
so finden wir 



Abcr — ist fiir | x, | < r analytiscb, so dafi man m ^- in cine Rcilic ent- 
u ° “• 
wickeln kann. Bei passendcr Wald der Intcgratiouskonstantcn crbaltcn wir 
also cine Rcibe 


,T.P+ *—*» 


liir die Glicdcr in der Klammer, wo u f der Koeffizicnt von x' in dcr Rcilic 
^dr ist. Die Ncnner kOnncn nicmals versebwinden, und die Rcibe bat 


denselbcu Konvergenzradius wic die Rcilic 




Daber ist a n fflr x, < r 


analytiscb, wie wir beweisen wollten. 

Daber konvergieren die Zeilen in <p tatsHchlich. Wir wollen danu be- 
weisen, dafi Auch die Kolonncn konvergieren. Dazu sebreiben wir 


<f> = x, (£. (x.) / 3 , (x.) x, H-) 

und setzen dicse Rcibe fur <p in die partielle DifTcreiitialgleidiung cin, streicbcn 
cincn Faktor x, uud setzen den Kocffizienten von a* Null. Wir finden cine 
annloge Gleicbung 


m, x,-f- n (m, + .r. Q(x,, o)) 3. -+- * m = o, 


wo nur von /3„, • • • , abhangt. Setzt man n = o und bemerkt, dafi 
= ° ist » so 1 'i‘dct man, dafi konstant ist, wir neb men sie glcich I. 
Wir kbnnen nun nacheinander beweisen, dafi , £,, • • • fiir |x, | < r analytiscb 
sind dutch eine ganz fllinlicbe Metbode wie bei 

\Vcnn also aucli die formalen Reiben <p , \i/ divergicren mogen, so kon¬ 
vergieren gleichwohl die Zeilen und Kolonuen und stellen Funktioncn dar, 
die wie in der Voraussetzung des Lemmas fur |x,| < r, |x,| < r analytiscb sind. 
Wir macben nun in ( 3 ) die weitere reelle Transformation 


x, = <p = x, A, x = 4' = x,B, 


wo A und B reelle Funktionen sind, die zusammen mit alien ibren particllen 
Ablcitungcn fiir | *. | < r, | x, | < r stetig und fur x, o, x, 4 = o analytiscb sind, 
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uml die im Sinnc des Lemmas liings der x,- und x,-Achse dieselbcn particllen 

Ableitungen wie die cntsprechenden formalen Rcihen fur--, — liaben. So 

x, x, 

erhaltcn wir ein neues System von Diflcrentialglcichungen dcr Form 
(5) K + }> (x .. x ,)), ^- = £,(m, + Q (x,, £,)), 


wo P, Q Funktionen mit gleichcn Stctigkcits- und Analytizitatseigcnschaften 
sind. Abcr sie liaben die weitere wichtige Eigenschaft, daO P(£, , x,) und 
Q(x,,x,) mit alien iliren particllen Ableitungen nach x, bzw. x, liings der 
x,- bzw. x.-Achse verschwinden. 

Um diese wichtige Kigenschaft zu beweisen, kehren wir zu den formalen 
Idontitiiten ( 4 ) zurtick und bemerkcn, daB wcnn */>, durch die ohen be* 
mitztcn Funktionen x, A , x ,B ersetzt werden, <lie Glieder linker Hand mit 
alien iliren particllen Ableitungen stetigund fur x, 4 = 0 . x J 4°an a lytisch werden, 
w&hrend sie mit alien diesen particllen Ableitungen nach x, bzw. x, liings der 
x,- bzw. x.-Achse verschwinden mussen. Dies ist die prfizise Bedeutung 
der frQlier zur Bcstimmung von 3., /3,, • • • benutztcn formalen 

IdentibHten. 

Man verifiziert leicht, daB diese um m ,£,, m t £ t vermehrten linken Seiten 
gerade die rechten Seiten der modifizierten Gleichungcn in x, , x, werden. 
Also haben wir 

d£ - „ dx , 

—1 = m,X, + x,C t -7- = m,£, + x,D . 


wo x, C uml x, D fur die linken Seiten von ( 4 ) stchcn, die offenbar die Fak- 
toren x, bzw. x, besitzen. 

Die Funktionen C und D verschwinden mit alien partiellen Ableitungen 
sowohl liings den £,, i.-Achsen als den x,, x,-Achsen. Also haben wir 

x, A = x,P, x,B = x,Q, 

wo P, Q die angegebenen Kigenschaften haben. 

Die Differentialgleichungen ( 5 ) haben daher eine viel zweckmaBigcrc Form. 
Fine wichtige Kigenschaft von P, Q in ( 5 ) ist die folgende: Fur jedes 
n sind P, Q hdchstens von der Ordnung £^I‘: 

( 6 ) P=o(£:x:), q = o(£:£:). 

Das sieht man leicht so ein. Es mbge M (J,, £,) irgendeine mit den Kigen¬ 
schaften von P, Q sci. Wir schreiben M = x, J/\ j If* ist dann mit alien seinen 
partiellen Ableitungen stetigund furx, 4 = 0 , £ t 4 : o analytisch, und verschwindet 
zusammen mit diesen partiellen Ableitungen liings den Aclisen. Durch wieder- 
holte Anwendung dicser Tatsachen findet man 

3/ = £, M' = x,x. 
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wo M\ Jf”, • • • die glciclien Eigenschaften baben uiul insbesondcrc in ilem 
betrachtctcn Bereiche beschrnnkt sind. Diese Methode fulirt unmittelbar zu 
Ungleichungen der Form (6) fur P, Q und fur alle ilire partiellen Ableitungen. 

Richten wir nun unsere Aufmerksamkeit auf den ersten Quadranlen £, > o, 
x, >o, und wahlen wir die Numerierung so, daB m, > o und ;/i,<o. 
Maclien wir dann die Transformation 


x, = r' 


■i *i 




— ■* ■* 


ftir x, > o, x, > o, so entspricht die Umgcbung der x, bzw. x.-Achse den 
Teilen der r,, r.-Ebcne, fiir die r, bzw. r, groQ und negativ sind. Die neuen 
Differentialgleicbungen werdcn dann 


(7) = + i t „), 

wo die nnalytisclien Funktioncn 


^ = —1 +N(r,,r,), 


rn, 

m. 






gcwisse nun anzugebende Eigenschaften besitzen. ZunSchst ist fiir jedes p>o 
Dies folgt aus ( 6 ), wenn wir 

nm,>p, — nm, > p 

nehmen. Weiter hnbcn alle parliellen Ableitungen von M und N nacli r, 
und r, flhnlicbe Eigenschaften. Dies folgt aus den Gleichungcn, die M und N 
definieren, da die partiellen Ableitungen von P, Q nach £,, £, auch Un- 
gleichungen vom Typus ( 6 ) genii gen. 

Aus den Glcicliungen ( 7 ) folgt, dafi fur r,< —77, T t < — 1), D> o, 

wenn entwedcr r, odcr r, groG und negativ sind, die Steigungen 

der analytischcn Kurven in der r,, r.-Ebene, welche LOsungen von ( 7 ) dar- 
stellen, nahezu —1 ist, so daG t, + t, nahezu konslant bleibt in dem Sinn, 
daG seine relative Anderung klein ist. Wenn man daher die Glieder von ( 7 ) 
integriert ausgeliend von eincm Punkt r, = t, = r der Linie r, = r,, die 
jede solche Kurve genau einmal schneidet, so linden wir 

(9) r, =T + f+f(r,/), r, = r — t + F(r, t) , 

wo t das Zcitintervall lfings der Kurve bedcutet. Nun ist 

1 

E(r t l) = ,)dt, 
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dr t 

wo das Zeitintervall kleiner ist als 2 |r|, weil die Geschwindigkeiten -jj- , 
naiiezu — i bzw. + i sind. Also liaben wir 

E = 0(re’’)= F= 


fur ein willkurlichcs p. 


a , i < r «+ r *> 

wSJirend- 

T 


langs einer solclien Kurve nnlic- 


zu 2 ist. 

Naturlich gehOrt zu jedem Punkt des betracliteten Bereiches der r,, r, 
genau ein r und /. Nun setzen wir 


dafi ( 9 ) die Form 
( 10 ) 


T] = T + t t T\ = 

T — t, 

• . pi 

'«+<) 

(rl-O 

T . = + F 1 

, ’ 

2 

T. = * + W 

>:+0 

2 • 

(*!—O 
2 


) 


bekommt. Diese Gleichungen definieren eine eineindeutige analytisclie Bc- 
ziehung zwischen dem Teil des Quadranten r, < — D, r,< — D der r,, r,- 
Kbene, fiir den r, oder t, hinreichend groO negativ sind, und einen fihnlichen 
Bereich der r*, r*-Ebcne, und es ist fOr jedes p 




Wir bemerken wciter, daB die partiellen Ableitungen von E, F nach 
t), t \ auch diese Eigenschaft haben. Wir wollen dies allein fur die ersten 
partiellen Ableitungen beweisen, aber es bietet keine Scliwierigkeit, das durch 
vollstfindigc Induktion zu verallgemeinern. Es geniigt natiirlich, den Nach- 
weis fiir die partiellen Ableitungen von E, F nach t, t zu fuhren. 

Nun haben wir nach der ersten Differentialgleichung ( 7 ) 



IE 

~di 




M (r,, r.). 


Daher ist es klar, daB -gy und Shnlich auch ^ die behauptetc Eigenschaft 
haben. Wenn wir weiter der Kurze wegen 


dE 

a7 = u 


schreiben und bemerken, daB nach ( 9 ) 

u. 


3r, 

~37 = 1 


a f 
37 


<K 

"37 


v 


I -hv 
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ist, wfthrcnd nach der Definition von r doch r, = r, r, = r fur / = o ist, 
so folgt, daC u und v fur / =o verschwinden. Aus den Glcichungcn ( 7 ) folgt 
welter (lurch Differentiation nach r 


du 

di 


3 M, . 3 M 
(l+u) + 


(1 +0). 


dv _ IN 

dt 3r, 


(l+«)4 


dN 

3 t\ 


(1 +»)• 


Da, wic wir wissen, die Ahleitungen von M und N auf der recliten Seifc* 
sind, und da u und v fur f = 0 verschwindcn, wiihrend ( niclit 2 |r | 
im absoluten Wert QbcrtrefTen kann, so ist es klar, daD u und v fur be- 
liebige p glcichfalls 0(r *') sind. 

Untcr Beachtung dieser Tatsachen kftnncn wir die letzte Transformation 


( 11 ) 



diskuticren. Die 7.11 ( 10 ) inverse Transformation ist offenbar von demselben 
Tvpus wic ( 10 ). Also ist 

x] = I,(i + A,(r,, r,)) , ti = 1 , : 


wo die h, und allc ihre particllen Ahleitungen annlytisch und 0(r r{ '' * '*') in 
dein hctrachtetcn Bereiche sind. Wenn also die h, im Gebicte X,>o, .P, >0 
als Funktioncn von X ,, X, ausgedrQckt werden. so sind sic fiir .f, :£ °» 4 + o 
Annlytisch und sind fiir geniigend kleinc X ,, X, init alien ihren particllen 
Ahleitungen stetig. A, und seine partiellen Ahleitungen verschwindcn fiir 
X, = o Oder X, = o. In der Tat vcrschwindet h, fiir X, oiler X t = o infolgc 

0 A ()/i 

der angegebenen Eigenschaft, wfthrend gleicli ~ ist dividiert (lurch 

7 /i, r"‘ '•, so dafi und ebenso alle anderen partiellen Ahleitungen von 

/, fiir X, = o oiler X, = o infolgc der angegebenen Eigenschaft verschwinden. 
Dalier kann die Transformation von x\, x\ in X ,, X t in der Form 


** = 4(1+91(4.4)) 




gcschricben werden, wo die y, und ihre partiellen Ahleitungen in der Um- 
gebung (les Ursprunges stetig sind, langs den Achsen verschwinden, und 
fiir x, 4 = 0 » =£ 0 analytisch sind. 

Bis jetzt liaben wir nur den ersten Quadranten bctrachtet, aber natOrlich 
kann auch in jedem anderen Quadranten eine solche Transformation erkliirt 
werden, und zwar so, dafi diese Transformationen zusammen eine einzige 
Transformation bihlen, welche auch in der obigen Form geschrieben werden 
kann, so dafi y ,, g, die angegebenen Eigenschaften in der vollen Umgcluing 
des Ursprungs haben. 
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Wcnn wir nun von der Variablen £,, x, zu x,,, x, zuruckgehen, so Iiabcn 
wir augenscheinlich eine Transformation von x*, x\ zu x,, x, definiert, welche 
die im Theorem behaupteten allgemeinen Eigenschaften besitzt. 

Andererseits lassen die Differentialgleichungen in x\ , x* im ersten Qua- 
dranten gemafl der Definition von r\ , r* die LSsungen 

zu. Tragt man diese in die Differentialgleichungen ein, so zeigt sich, dafi 
sie die im Satz behauptete Normalform fur den ersten Quadranten haben und 
fthnlich auch in den anderen. 

So ist der Beweis des Satzes vollendet. 

Ks verdient noch bemerkt zu werden, daO zwar in dem Falle der Konvergcnz 
die Normalvariablen x,, x,, die die Gleichungen auf die Normalform reduzieren, 
bis auf konstante Faktoren bestimmt sind, dafi aber die Normalvariablen x,, x, 
nicht so bestimmt sind. In der Tat beweist man leicht, daB die Variablen 

*r = x;(i +% i (xr m 'xr)) <<-'•’> 

(lie gleichen Dienste leisten, wofern %,(u), %.(«*) fdr jeden Quadranten erkliirt 
sind, fOr u>o init alien ihren Ableitungen stetig sind, mit diesen Ablcitungen 
fiir u = o verschwinden und fur u >o analytisch sind. Das fQhrt natiirlich 
zu dcr Fragc, ob nicht durcli passende Wahl der %j die Normalvariablen 
weiter vereinfacht werden kflnncn. 

' Bei Ausdehnung unseres Krgcbnisses auf die anderen speziellcren, vflllig 
instabilen FSlle n= 2 und auf die vOllig instabilen Falle n>2 ist die 
erstc zu Qberwindcnde Schwierigkeit in der Bestimmung geeigneter invarianter 
Flftchen S t (k<n) gelegen. Nur wcil sich diese iu dem behandelten Falle 
n = 2 auf zwei analvtische Kurven reduzierten, konntc diese Schwierigkeit 
vermieden werden. 


Ausgcgeben aru 13. Mai. 


Btrlin. gtdruckt in d»r R*i«J>«druck»m 
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DIVERGENT SERIES AND SINGULAR POINTS OF 
ORDINARY DIFFERENTIAL EQUATIONS* 


BY 

GEORGE D. BIRKHOFF AND FREDERIC R. BAMFORTHf 
Introduction 

The analytic form of the solutions of the system of differential equations 


(1) 


_ dxi _ 

Xl(x\ t • • • , Xn) 


dx n 

*■(*., • • • , X n ) 


in a neighborhood of the point xi= • • • =x n = 0, in which all the functions 
Xi are supposed to be analytic, X t (0, • • • , 0) being zero for * = 1, 2, • • • , 
n, has been the subject of much study. This is justified by the applications 
which can be made of this form to various theories in analysis and in 
dynamics. Dulacf has simplified the problem in many cases by reducing 
the equations (1) to simple reduced forms of which the integration can be 
made without difficulty. The integration of these reduced equations then 
furnishes the solution of the system (1) either in terms of a parameter or 
in the form of a system of integrals. 

Let m< be the roots of the so-called characteristic equation which, when 
written in determinant form, is 

\dXi 

I dXi 


mb 




- 0 , 


*«-0 


where $ w -0 if i**j and 1 if i -j, and let Li represent the linear terms of X it 
i-1, 2, • • • , n, when these functions are expanded according to powers 
of X\ t • • • , x n • Three conditions which have played an important role in 
the study of the system (1) may now be written as follows: (i) if the num¬ 
bers nii are represented on a complex plane, there exists a straight line pass¬ 
ing through the origin which is such that all the points m< are on the same 
side of it; (ii) there exists a linear change of variables x, = *,(yi, • • • , y„) 
such that the system of differential equations 



• Presented to the Society, March 29,1929; received by the editors May 15,1929. 
t National Research Fellow. 

t Two papers to which reference will be made in the text are the following: I: H. Dulac, 
Bulletin de la Soci£t6 Math£matique de France, vol. 40 (1912), p. 324 et seq.; II: G. D. Birkhotf, 
Berlin Sitaungsberichte, 1929, pp. 171-1&3. 
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is equivalent to the system 


tniyi ; 


(iii) there exists among the numbers m, no relationship of the form 

(2) mi = pimi +-H 

Pj being positive integers whose sum is greater than 1. We shall refer to these 
conditions frequently. 

On introducing a parameter l it is seen that the system (1) can be re¬ 
duced to the system 


dX{ 

— - •••,**) 
dt 


(i= 1,2, ••• , n). 


The following lemma can be readily proved* by the aid of a theorem of 
Poincare’s and is already in the literature: 

Lemma A. When the conditions (i), (ii) and (iii) are fulfilled, the system of 
differential equations (3) is equivalent in a neighborhood of x i*= • • • =x n = 0 
to the system 


- mai 


(» - 1 , 2 , •••,»), 


by means of a one-to-one analytic transformation of the form 

(5) ti - M*i, •••,*») (t - 1,2, • • • , »), 

which leaves the point *i=xj= • • • = = 0 invariant. 

That this transformation is analytic is due to the fact that the formal 
power series solutions of the partial differential equations 


d$, d<t>, 

—X l + • • • + = m*< 

dxi dx m 


«- 1 . •• . *) 


converge for |*<| <r, r>0. But when the satisfy the conditions (ii) and 
(iii) and not the condition (i) in the case of real variables, the power series 
solutions of the equations (6) may diverge. Nevertheless, as has been con¬ 
jectured, t even though these series diverge there may exist transformations 
playing the same role in which the functions </», are of class C* in a neighbor¬ 
hood of the origin and are analytic at every point of this neighborhood except 


•I,p. 328. 
f See II. 
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possibly those which lie on certain manifolds of dimensions less than n. 
The existence of such a transformation has been proved* for the outstanding 
case n = 2 in which m, is greater than zero, and m* is less than zero. In the 
present paper an analogous transformation is obtained for the case n = 3 in 
which the m, are real and not all of one sign. It seems likely that the 
remaining important case « = 3 not yet treated, in which m* and m% are con¬ 
jugate imaginaries with real part of opposite sign to m\, will offer essentially 
no new difficulties. 

The case n = 3 here treated differs essentially from the case » = 2 in that 
for the former it is necessary to define two so-called invariant surfaces. The 
cases for «>3 seem to offer further difficulties. These will be discussed more 
fully in another paper. 

Since in Section I we shall make use of another lemma which is already 
in the literature,t we shall quote it here in a restricted form as 

Lemma B. When the right hand sides of the differential equations (3) 
have the form m(Xi+X u , #-l, 2, • • • , n, where X u have no terms of degree 
less than two, m u • • • , m p are greater than zero, m, + i, • • • , m„ are less than 
zero, and the condition (iii) is satisfied, there exists a transformation of the 
form 

yi= xi + ffx^x, • • • , x n ) (i - 1 , • • • , p), 

y* - x { O' = P + 1. • • ’ > ")» 

in which the functions fi are analytic in their variables in a neighborhood 
of x,+i=* • • • =-x* = 0, which reduces the differential equations (3) to ones of 
the same type in y* with the additional property that the function on the right 
hand side of any ith equation vanishes when y x = • • • =y p -0. 

I. Preliminary analytic normalization 
Consider the system of differential equations 

(7) = X£x x ,xt,xi) (» - 1,2,3), 

dl 

in which the X t are real power series in the x it have no constant terms, and 
converge for \x t \ where fi>0. The coefficients in the X, are constants 
and not functions of the parameter t. Let m v , m, and m, be thejroots of the 
determinant equation 

ax* ( | 

- mbij I = 0. 

_ & X i I *i— x t —x,—o 

•Loc. cat. H. 
f I, p. 359 et seq. 
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It will be assumed that m x and m, are greater than zero, m* is less than zero, 
and that the conditions (ii) and (iii) mentioned in the Introduction are 
satisfied. 

The object of this paper is to show that the system of differential equa¬ 
tions (7) is equivalent to the system 

(8) — = m£* (i = 1,2,3), 

dt 

by means of a one-to-one transformation of the type 

(9) - ♦<(*».*•»**) 

where the functions on the right involve the displayed arguments only, are 
zero at the origin, x,=*,-x, = 0 , are continuous together with all their 
partial derivatives within a certain neighborhood of the origin, are analytic 
at any point of this neighborhood which does not lie on any one of a certain 
set of three surfaces which pass through the origin, and have certain further 
analyticity properties. If m t were positive instead of negative and the con¬ 
ditions (ii) and (iii) were still satisfied, then the transformation (9) might be 
chosen analytic in a neighborhood of the origin as Lemma A shows. However, 
when ntj is negative, a transformation of the type just mentioned is the closest 
approach to an analytic transformation which we have discovered that will 
reduce the equations (7) to the equations ( 8 ). The equivalence of these two 
systems will only be shown to exist for a certain small neighborhood of the 
origin. 

By an “invariant surface” will be meant a surface in the space of the de¬ 
pendent variables which is made up of integral curves of the differential 
equations (7) and contains the origin as an interior point; by an “invariant 
curve” will be meant an integral curve of (7) which passes through the 
origin. Now the classical theory of the system of differential equations (7) 
tells us that there exists an analytic invariant surface which contains two 
analytic invariant curves, and that there exists a third analytic invariant 
curve which does not lie in this surface. The first sequence of transformations 
which will be used will reduce the equations (7) to a special form in which 
these three analytic invariant curves are the three axes in the space of the 
dependent variables and the analytic invariant surface is one of the co¬ 
ordinate planes. 

On account of the assumption that the condition (ii) is satisfied, we may 
assume that the right hand members of the differential equations (7) have 
the forms 

( 10 ) Xi = m i x i + X ti (x i, *,,*,) (i = 1,2,3), 
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in which the X u have no terms of less than the second degree in their argu¬ 
ments when they are expanded as power series about the origin. 

Now let us examine the differential equations (7) whose right hand mem¬ 
bers have the forms given in (10) in the light of Lemmas A and B of the 
Introduction. Since we may perform the two analytic transformations which 
Lemma B assures us exist, the first being obtained when we consider m,, m, 
as the mi, • • • , m p of the lemma, and the second when we change / to -f 
and consider m, as the m,, • • • , m p of the lemma, we may assume that the 
Xu of (10) have the property that 

(11) X u (0,0,x t ) = *„(0,0,*a) =■ *«(*!,*»,0) - 0. 

The performing of these transformations moves the third analytic invariant 
curve and the analytic invariant surface, which have been mentioned before, 
into the xi=x, = 0 axis and the x*-0 plane, respectively. 

We wish now to move the two analytic invariant curves in the x,=»0 
plane into the cohrdinate axes of that plane as well as make further simpli¬ 
fications. Since we wish to work in the Xi-0 plane, let us set x,-0 which 
is seen to satisfy the third differential equation of (7) on account of the 
equations (11). The other two equations of (7) then take the form 


d*\ 

dt 


— m iXt + Xu(xi,xt,0) t 


dxt 

dt 


=■ m t xj + X t> (xi, x*,0). 


In as much as these equations satisfy the hypotheses of Lemma A, and we 
may perform the transformation which that lemma says exists, we may 
assume that 

Xn(x lt x t ,0) - X n (x lt x t ,0) = 0 


which implies that the analytic invariant curves in the x% = 0 plane are the 
coordinate axes. 

Since, on account of the equations (11), which are still true after the 
transformation just mentioned has been performed, Xi = 0, Xi = 0 satisfies 
the first two equations of (7), we may assume that X M (0, 0, x») =0, because 
if it were not so we could make use of an analytic transformation of the 
form 

yx = *i, y* = *>, y% = *» +/(x.), 
whose existence is assured by Lemma A, to make Y n (0, 0, y>) =0. 
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As a result of this discussion we may state that without loss of generality , 
the differential equations (7) may be assumed to have the form 

d x i 

— = Xi(Wi + XtPx) + XsXgQi, 

dt 

dxt 

(12) — = x f (mj + x t Pt) + xix^j, 

dt 

dx i 

—- = *i(«i + P* + xtP 4 )» 
dt 

where the Pi and Q t are analytic functions of their arguments x, in a neighbor¬ 
hood of the origin, for |x<|£r, r>0, i = 1, 2, 3, Qi is a function of x, and x, 
only, and Q t is a function of x x and x, only. 


II. Convergence properties of formal series 


Let us recall that the aim of the present paper is to find three functions 
0 <(x 1 , x t , x,) such that, when the Xy are replaced by an arbitrary solution 
xy (0 of the differential equations ( 12 ), x*- 0 ,[xi(/), *j( 0 ) satisfy the 

differential equations ( 8 ). Since all three of these functions can be found 
in exactly the same way we shall confine our attention for the present to 
4>\. Evidently 4> 1 must be a solution of the partial differential equation 


(13) 


— (xi(m» + x»P t ) + x,x»()|] + -[x*(mj + x a P f ) + X|Xj Q t ] 

dxi dx t 

+ T~[**(»*j + x x Pi + X* P 4 )] » mi4>x. 
ox a 


Equation (13) has a formal power series solution whose only linear term 
is ax 1 , where a is an arbitrary constant. This power series solution is unique 
if the coefficient of x, is chosen as 1. Let us assume that this has been done. 
Furthermore, every other term has a factor x,x, or x,x, as is readily seen by 
alternately setting x,=x, = 0 and x, = 0 in (13) and solving for 0 ,. Let this 
formal series be 


O 4 ) 0. ~ *!+••• + xPxfxf + • • • , 

TtX *fl .v , 

which obviously can be written in either of the forms 

(15) 0i~ao(*i,x,) + a \{x\,Xi)xi + • • • + a p (x lt x t )xf + • • • , 

or 
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(16) 0i ~ <*oo(*») + «io(*«)*i + • • • + a«,(*j)*i"*a" + ’ ' • 

We shall now prove 

Lemma 1. The series a p of (15) and the series a mn of (16) all converge 
for |*i |, I*, | <r and for |x,| <r respectively. 

Substitute the series (15) into the equation (13) and equate the coefficients 
of like powers of x,. Evidently a 0 ~*»> and <*i satisfies the equation 

(17) —xitni + — xtm, + «i(m, - mi + x x P t o + x t Pf) = *i, 

dxi dx t 

where Pn-Pfeu **> 0) and <!>i is linear in a 0 with coefficients which are an¬ 
alytic in z, for |x, |, \x t \ £ r, and are zero for *»=*, = 0. 

Set 

(18) cri - 0o(*») + • • • + &(**)*»* + • * * » 

where the fa are formal power series in *,. Then fa formally satisfies an equa¬ 
tion of the form 

—rim* + ^o(ffli - «i + xtPioo) ™ *10, 

dx t 

where P 40 0 and «*>, 0 are analytic in x, for |z, | £r. On account of the condition 
(iii) of the Introduction being satisfied, it readily follows that fa is analyticf 
in x, for |x,| gr. In a similar manner it can be shown that all the are 
analytic in *, for |x,| £r, and if a, be arranged according to ascending 
powers of x* it can be shown in the same manner that the coefficients are 
analytic in x, for |*i| £r. 

Now define 

B, ® 0o + &\X\ + • • • + 0.-iXi* -1 

and 7 , =a, - B .. Evidently 71 has a factor x,', and hence, writing 7 , = x,* f 1 , 
we see that {1 will satisfy an equation of the form 

—xirni + —Ximj -f {i(m, - mi + sm, + XiPjo + x t P 40 ) 53 * 11 , 
dx\ dxt 

where $>,, is analytic in x,, *, for |x, |, |x, | £r. Let 5 be large enough so that 
m,+m,(j-l) is positive and define m K as equal to this quantity. Let m be 
a number that is greater than zero and less than the smallest of m x , nh 
and m t . Then 

| £mi1+ <?m, + m 4 1 > m(p + q + 1) 

f Cf. H, pp. 177-178. 
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for any pair of positive integers p, q. Choose M> 0 and such that x x Pto 
and xtPi o are dominated by 


(19) 


M 


(-tH'-t) 


- M, 


and 4>n is dominated by the first term of (19) only. Then the power series 
£, is dominated by the power series solution of 


( 20 ) 


/ V . a/\ , r m ,/i 

i( Xx- h Xt—i = j I - m + — —-- - M 

K ' ' L (-"X-f) J 


M 


(-7X-7) 


On account of the symmetry in (20) with respect to x x and Xt the power 
series solution of (20) converges for |x 3 |, |xj| <R, where R is the radius of 
convergence of the power series solution of the differential equation 



From this it follows that /?~r. Hence a x is analytic in x x , x* for |xi|, |x*|<r. 

The proof of the lemma for the other follows readily by induction 
since every a { satisfies an equation of the form (17) where on is replaced 
by on, nt 3 by im 3 , and 4>i by where is a polynomial in ao, • • • , a<-i, 
with coefficients which are analytic in x h x, for |x» |, |xi | gr. 

The a m „ can be proved to be analytic in the same way that the /3< of 
(18) are proved analytic. This completes the proof of Lemma 1. 

Similarly, we may prove that when the series corresponding to </> 2 and 
<t >3 are arranged as power series in x, the coefficients are analytic in Xi, x* 
for |xi |, |x a | <r, and when arranged as power series in Xi, Xj the coefficients 
are analytic in x 3 for |x 3 1 <r. The only linear term in <f> 7 is the one involving 
x 2 , and every other term contains as a factor XiX, or x 2 x 3 . The coefficient of 
the linear term is arbitrary and when it is chosen as 1 the series for fa is 
unique. Let us suppose that this choice has been made. The series for 
fa has properties similar to those of fa and fa with the additional one that 
x 3 is a factor of every term in the series. We shall choose the coefficient of 
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the linear term as 1 and this choice uniquely determines all the others. 

If the formal series for <t> { , i = 1, 2, or 3, should converge for |x<|<r 0 , 
r 0 >0, i = \,2, 3, it can evidently be used to reduce the tth differential equa¬ 
tion of (12) immediately to the normal form dzT/dt = m^i*. 

Even though the series for the 4> t should diverge, it will be found that 
they can be “fitted” by functions of class C" in a manner described in 


Lemma 2. Let the formal series 

S(x = £ — rTr, x ~ x * x * » 

in which all the a mnp are real, have the property that when it is arranged as a 
power series in x t , i.e., 

(21) S = <*o(xi, xj) + ai(x,,x t )xi + • • • + a,(xi t xt)x?/p\ + • • • , 

the a p are analytic in the complex variables x x , x, for \x x \, |x, | gr, r >0, and 
when it is arranged as a power series in x Xt x,, i.e., 

(22) S = aoo(xj) + a, 0 (xj)xi + • • • + a mn (x$ )x x m xf/(m\n) + • • • , 


the a mn are analytic in the complex variable xj for \x t | Then there exists 
a real function F(x x , x a , x*) of the real variables x x , Xj, x» which is continuous to¬ 
gether with all its partial derivatives for |xi|, |xi| f |x, |^r, is analytic for 
x x j*0, Xa^O, Xi^O, satisfies the equations 


(23) 


dxrdx 




d’F I 

— " a,(x x ,x t ) ( m,n,p 



and has the property that it and all its partial derivatives are analytic in x { 
and Xj for |x<|, |x, | £r, 0, x,*0 f x* = 0, iv*j t i*k,jj*k, and are analytic 

in Xjfor \x { | £r, x f = x* -0, i,j, k -1, 2, 3. 


Consider the seriesf 


(24) F(x x ,x t ,x t ) = X) xfxf xf (1 - g~ 1/fl "">), 

m.*.p-omln\p\ 

where i-1 if m£n, 2 if m<n, b mmp -l+\a mm ,\, B^^b^^xp. At 
first we shall consider this series only in the six-dimensional complex 
region defined by 

(25) \*i\$r, | arg x/| g 0, 0 < 6 < x/8 O' - 1,2,3). 

In this regio n |arg x}x} | does not exceed t/2, so that the exponential 

t Cf. II, p. 173. 
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terms in the series have exponents — 1 /B mnp with negative real 

parts, and so are less than one in absolute value. 

Moreover, we find for | B mnp | ^ 1, 


0mnp(l - | = I a mnp |- 


1 


1 


+ • 


B mnp 2 \BL p 

g | a.*, | • (e - 1)/1 B m « p | £ 2/[ | *, |* | |*J. 


But if, on the contrary, \B mnp | < 1, then, by the definition of B mnp , \a m n P | £ 
1/[|*|* |x<|*]. Since the quantity in the parentheses in the preceding 
inequality is obviously less than two in absolute value, we conclude that the 
inequality holds in this case too and so without restriction on B mnp . Thus 
the given series is dominated by the series 


(26) 

where 

(27) 


at.H.p— 0 ftiyi'P'. 

m, = »n-2ifm^» and m if m < n, 
«i «n — 2 if m < n and n if m £ n. 


But it follows from the hypotheses concerning the a p that that part of the 
series (24) for which p£3 converges uniformly in the region (25); and it 
follows from the hypotheses concerning the a** that that part of the series 

(24) which gives rise to a negative m, or to a negative n x converges uniformly 
in the same region. Hence the series (24) converges uniformly in the region 

(25) , and an examination of this series will readily show that it is analytic 
except for x x - 0, x* * 0, or for x t * 0, and that 

F(x i,*i,0) — a<o, F(0,0,x$) = a 0 o. 


In a similar manner, all the series obtained from F by partial differentia¬ 
tion can be shown to have the same properties of analyticity and con¬ 
tinuity as has F in the region defined by (25). This is on account of the 
fact that thear,, of (21) and thea wn of (22) are analytic for \x x |, |x* |, \x» | 
and that for a finite number of differentiations of (24) only a finite number 
of negative powers of x x and of x, are introduced into the dominating series 
which correspond to these derivative series in the same way that the series 

(26) does to F. We shall consider only dF/dx it 


dF 

dx, 




1_0 m\n\(p — 1 )! 


+ i 


a-np /- 2*-»'*• 

xfx t n x,* 


m\n\p\ 


/ - 2g- t '*-«A 

V B mn ,x, A 
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The first of these series is evidently dominated by 



2|«i|-| *»M *»!*-» 

m!n!(p - 1)! 


and the second by 



21 x,|-| i,I — 

m\n\p\ 


From the remarks made above concerning a p and a,„ etc., it readily follows 
that dF/dxi has the same properties of continuity and analyticity as has F 
itself. Furthermore, 


dF 


dx» 


= <*!• 

«i-0 


F(x i, Xi, x,) is thus a real function in the octant Xj^O, x*?>0, x,^0, 
continuous together with all its partial derivatives in this octant, analytic 
for Xi>0, xj>0, x s >0, and satisfies the equations (23). Similarly F is seen 
to have the same properties in each of the other octants and the eight 
functions thus defined evidently unite to form a function F(x Xl Xt, x,) defined 
by the series (24) and having the properties stated in the lemma down to 
and including those relative to the equations (23). To prove the last state¬ 
ments made in the lemma concerning the function F and its partial deriva- 
atives we need merely use dominating series in a manner wholly analogous 
to that in which they were used above. 


III. Final analytic normalization 

The problem in hand is more readily discussed if the functions Q x and 
Q* appearing in the differential equations (12) have the factors xjx s andxix,, 
respectively. We shall now show that the assumption that such is the case 
can be made without the introduction of any further hypotheses than have 
already been made. To this end let us consider the transformation 

( 2fi ) y< = s t (x i,*t,*«) (• - 1,2,3), 

where the series s t (x h x,, x,) consist of all the terms of the formal series for 
0„ *'=1, 2, 3, respectively, involving Xi"**“, m+« = 0, 1, • • • , X + l, all the 
terms involving x,*, ^ = 0, 1, • • • , X + l, and no others. For the present X 
will be considered as any integer greater than 1 but will be determined in a 
later section as depending on the m,. The properties of convergence of the 
coefficients of the formal series for the when these series are arranged ac¬ 
cording to powers of x 3 or according to powers of x x and x* imply that the 
series converge for |x, | <r. 
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From the definition of the s t it follows that 


— -[x x {m x + xiPy) + xtx tQ x ] + —— [xj(mi + xiPt) + XiX,Qt] 
dxi dx * 


dSi 

+ —— xi(mi + x\P% + XtPi)— <*• 
dx» 


define functions Ri(x x , x*, x a ) which have no terms of degree less than 
X +2 in x, and less than X+2 in x x and a* together. Since the only linear term 
in Si is x^, i-l, 2, 3, respectively, when we solve (28) for x<«/<(yi, y*, y«) 
the only linear term in /< is y if i = 1, 2, 3, respectively. Hence the R t when 
expressed as functions of y/ have no terms of degree less than X + 2 in y* 
and less than X+2 in y x and y t together. Thus, when x, are replaced by an 
arbitrary solution x<(/) of (12), the corresponding y, satisfy the system of 
differential equations 

(29) ^ «- 1,2,3), 


where the arguments of the R t are the y,. But every term in the formal 
series for 4> x , except the linear term, contains xix t or xjx* as a factor, and 
corresponding statements are true concerning the formal series for the <£> 
and 4 >«. This implies that when x, = 0, then y, *= 0, x x « y, and x* = y*, and when 
Xi—Xj—0, then yi—yi*0 and xj~y a . Thus the transformation (28) will 
preserve the form of the differential equations (12) and hence the form of 
the system (29) is the same as that of the system (12) when the x are replaced 
by the y<, »-l, 2, 3. Furthermore, on account of the properties of the Ri we 
see that the new Q x will have yfyf as a factor and the new Q t will have y x x y» x 
as a factor. Hence we are not losing in generality when we assume that the 
differential equations under discussion have the form 

dx i 

— = x x (m x + xi P x ) + xfxfQ x , 
dxt 

< 3 °) — = *.(**» + x t p t ) + x{ xSQ,, 

at 

dx t 

— - x,(m, + x x P» + x,P 4 ), 

where the and Q. are analytic in x, for |x, | ^r, r >0, j= 1, 2, 3, Q x is a 
function of x, and x, only, and Q t is a function of x, and x, only. Further¬ 
more, every P, has x 3 x+1 as a factor, and when expanded as a power series in 
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Xi and *2 contains no term of degree less than X + l in x, and Xj together; 
Qi contains x 2 x x t x as a factor, and Q t contains x x x xf as a factor. These facts 
will be used in the next section. The partial differential equation (13) and 
the corresponding ones in fa and fa will now be replaced by the system 

— [x,(m, + X> Pi) + + “-[**("»* + x * p *) + *i**J*(?i) 

dx t °x t 

W) 

H--[x«(m, + x x Pi + xtPi)] = m t fa (t = 1,2,3). 

dx» 

IV. Construction of auxiliary invariant surfaces 

It happens that if @, = Q, = 0 we can fairly readily obtain solutions of 
this system (31) of class C* in a neighborhood of the origin, and thus solve 

our problem. Hence we shall try to find a transformation of the form 

(32) Xi - Xi(y lt yt,yi) (*' “ 1,2,3), 

which will reduce this system of partial differential equations to an equivalent 
system in which the coefficient of dtpjdy, has y> as a factor, », j = l, 2, 3. 
Evidently we may choose x* -y» as the last equation in this transformation. 
Now if the system of partial differential equations when expressed in terms 
of the y, have this additional property, the transformation (32) will take the 
integral curves of the system of differential equations (30) into the integral 
curves of a system of differential equations of the form 

“ yt(mi + Ti) «- 1,2,3). 

at 

Hence our search for such a transformation (32) is connected with the 
search for two invariant surfaces of the system of differential equations (30) 
other than the known one *a = 0. In this connection we shall prove 

Theorem 1. There exists a function /i(x,, x,) which is of class C* in a 
neighborhood of x 7 = x t = 0, is analytic in x,, x t for Xjj*0, x,^0, is, together with 
all its partial derivatives , analytic in either variable when the other variable is 
zero , and is such that 

x x = xf xffx(xt,Xi) 

is an invariant surface. There also exists another function gx(x lt Xj) with 
properties analogous to those of fx(x Jt x t ) such that 

x t = x? xfgx(xx,xt) 

is another invariant surface. 
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Since the method to be used for the determination of one of these in¬ 
variant surfaces can be used for the determination of the other, we shall 
confine our attention to that surface which will be transformed into the 
y, = 0 plane. To this end consider the transformation 


(33) 


yi 


Xi - xfxtf x (xi,xi) t y» = x it y , = X tl 


where the function f x is to be so determined that when the x< are replaced 
by an arbitrary solution x<(/) of (30) the y x (t) so defined will satisfy a differ¬ 
ential equation of the form 


(34) 


dy x 

— = y i(«i + T x ) t 

at 


in which T x is a function of the y< of class C* and T x (0,0,0) =0. On differ¬ 
entiating the first equation of (33) with respect to /, replacing the derivatives 
dxi/dt by their values given in the equations (30) and x x by its value given 
in (33), and noting that yi = 0 is to be a solution of the resulting differential 
equation on account of the required form (34), we obtain the result that 
Mxt, Xi) is a solution of the partial differential equation 


(35) 

in which 


df x a/i 

T—*»( m * + x » R i) + T— x»(m» + x t Rt) 
dx t dx» 

“ /i( m « “ 2x,/?, x,/?,) + 


w 4 = m, - 2m, - 2m,, /?,«=/>,+ xfx t y x *Q t , 

Rt - x t xtf x P % + P it R s -= p x - 2 R u 


where the P< and the Q t are the same functions of x}x}f Xy x,, x, as they were 
of x x , xt, x t in (30). From the fact that every P t has x, x +‘ as a factor and when 
expanded as a power series in x, and x, contains no term of degree less than 
X + l in x, and x, together, and from the form of the R t it follows that every 
Ri has xj x+, x, x+l as a factor. If /i(x,, x,) is any solution of the equation (35) 
which is of class C- in a neighborhood of the origin, the function y t so defined 
by the first equation in (33) will satisfy an equation of the form (34) when 
the x.- are replaced by an arbitrary solution x.(/) of the differential equations 

30 • faCt ,S an immediate consequence of the manner in which the 
partial differential equation (35) has been derived. 

We shall now prove 


453 



128 


G. D. BIRKHOFF AND F. R. BAMFORTH 


(January 


Lemma 3. There exists a solution /i(xj, x*) of the partial differential equa¬ 
tion (35) which is of class C® in a neighborhood of the origin Xa = Xj = 0 and 
is analytic at any interior point of this neighborhood for which x*?*0, x 2 ^0. 
Furthermore , this solution and all its partial derivatives are analytic in either 
variable in a neighborhood of Xi = 0, i = 2 or 3 as the case may be, when the 
other variable is zero. 

From what has gone before, Theorem 1 will be obvious when Lemma 3 
has been proved. 

The first thing to be observed is that there exists a unique formal power 
series solution of the partial differential equation (35) which may be written 
in any one of the three forms 

fi ~ floo +-1-H- 

m\n\ 

(36) = ao(xj) + • • • + a„(xj)xj" + • • • 

■ 0o(xi) + • • • + Pn(x»)x *• + • • • . 

Since every R , and Q x has Xt x Xi x as a factor, this formal series for f x also has 
x* x x a x as a factor. If this series does not diverge for every pair of values 
x 2 ?^0, x a ?*0, our lemma is obviously proved, so that we shall confine our 
attention in the following discussion to the contrary case. By making use 
of the methods employed in the proof of Lemma 1 it can be shown that all 
the a m (x 7 ) are analytic in Xi for |x* | <r and that all the £„(*») are analytic 
in xj for |x 2 1 <r. This follows from the fact that wherever f x appears in P t 
or in Q 3 it is multiplied by xfxf. Now define a function g(x t , x») which 
corresponds to the formal series for f x in the same way that the function 
F corresponds to the series 5 in Lemma 2, it being supposed for this applica¬ 
tion of this lemma that no x x appears in the series 5 and that that part of the 
function F which is independent of x* is replaced by the corresponding 
analytic series which is a part of S. On making the transformation /i=/+g 
we see that f x satisfies the differential equation (35) if and only if / is a solu¬ 
tion of the partial differential equation 

(37) -~—xt(mt + x>Si) + -~—xi(mt -f x>5*) = /(m 4 + St) + S i} 
dx 7 dxj 

in which the 5, are well defined functions of /, Xt, x 9 , S$( 0, 0, 0) being zero and 
St being independent of /. These functions are given explicitly by the follow¬ 
ing equations (38) and have the properties that they are of class C® in a 
neighborhood of /=xt = xa = 0, are analytic in / in a neighborhood of f =0 
for x 2 and x a in a neighborhood of Xj = x a = 0, and are analytic in Xj, x> in a 
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neighborhood of xj=x 3 = 0 for x 2 ^0, Xj^O and / in a neighborhood of/=0. 
All these properties are immediately implied by the equations 

Si(/,xt,x,) = Ri(J + g,x 2 ,x t ), S 2 (J ,x 2t x%) = R 2 (J + g,xt,x,), 

1 ( dg 

S$(f ,xt,Xi) = — 2 x 2 R 2 + x% R% -s- xjx 3 (Ri — Rio) 

/ idxs 

(38) + p-x,x,(R, 2x,(R, - R ,„) 

dx t 

+ x,(R,-RJ] |, 

Si(f t x t t x i) = Qi ~f" 2x 2 R 2 o ■+• x 2 R 2 o) - x 2 (m 2 -\-x 3 Rio) 

dx 2 

dg 

— - — x 2 (m 2 + x 2 R 20 ), 

dx , 

where the arguments of the R t are /+g, x 2 , x 2 and R i0 = Ri(0+g, Xt, x 3 ). 
Evidently every 5, has Xi x x 3 x as a factor. This fact will be made use of later 
on. From the definition of g(x 2 , x 3 ) it follows that every partial derivative 
of Sa is zero when either xj or x 3 is zero. Hence for every pair of positive 
integers p, q a function S pq can be defined which has the same properties 
of continuity and analyticity that S A itself has and satisfies the equation 

(39) Si - x t ’xt'S M . 

In other words, Si admits of the factor x,*x 3 « for arbitrary p and q. This 
fact can readily be established by taking the factors x, and x, out of 
Si one at a time, e.g., for x 2 *0 the function Si/x* has all the properties of 
continuity and of analyticity that S 4 itself has. The limits as x*-*0 of 
Si/ x, and all its partial derivatives exist and are zero. Hence S 4 /x, defines S i0 . 
Now let us consider the system of ordinary differential equations 

— = /(m 4 + S 2 ) + Si, 

< 4 °) = **(”**+ 

at 

dxt 

— - x 3 (mj -f XtS 2 ). 

If we can determine a surface/=/(x*, x,) in the/, x*, x, space which is made 
up of integral cunres of the system of ordinary differential equations ( 40 ), 
such that/(x 2 , x 3 ) is of class C° in a neighborhood of x 2 = x 3 = 0, and is analytic 
at any interior point of this neighborhood for which x*^0, x 3 ?*0, the func- 
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tion f(x a, x a ) so defined will be a solution of the partial differential equation 
(37) and will define by means of the equation/i=/+ga function/i(xa, x a ) 
which will be a solution of the partial differential equation (35). Thus if we 
can find such an /(x t , x t ) our lemma will be proved. 

Suppose for the sake of definiteness that m A is greater than zero. We 
shall limit ourselves to a closed region R about / = xj=x 3 = 0 in which 
\xt\ <1, i = 2, 3, m t +Si and mj+XaSi are greater than zero, and m a +x*S, 
is less than zero. We shall suppose that R is further limited so that for 
(J, x a ) any point in this region the functions Si are analytic in /, of class 
C" in xt, Xt, and are analytic in x*, x a if Xt^O, x a ^0. That this is possible 
follows from the properties of the S { stated above. The following notation 
will be adopted for this region R: 

(41) " maX = max ^ w * = max + *•$*)» 

A^i = min (m 4 + 5 a ), N t - min (m a + x*Si), JV a = min (m a -f x>S s ). 

Now consider the surface defined by the integral curves of the differential 
equations (40) which pass through the line 

(42) / - 0, Xj = Xa - r, r £ 0, 

and suppose that / = 0 on this line. The equations of this surface may be 
written as 

(43) / -/o(*,r), Xt - X|(f,r), x # - Xa(/,r), 

where the functions appearing on the right hand sides of these equations, 
on account of the analytic properties of the S it are analytic in their argu¬ 
ments so long as r^O. Until further notice, it will be understood that the 
discussion from now on is relative to that quadrant of R for which x* and 
xa are positive. We wish first of all to show that there is an open region R 0 
of the x Jf x 3 space which is bounded by the axes x 2 = 0, Xa = 0 and by the arc 
of a circle, having its center at the origin and its radius different from zero, 
and is of such a nature that, when (xj, x a ) is any point in it, the last two 
equations of (43) can be solved for t and r as single-valued functions of Xa 
and x a . For such a region these functions / = /(x*, x t ) and r = r(x s , x 3 ) will 
evidently be analytic, and when they are substituted in the first equation 
of (43) we shall have /=/(x t , x 3 ) where /(x,, x a ) is an analytic function of 
its arguments for (x 2 , x a ) in Ro. 

To this end let us find how the t of any point of the surface (43) depends 
on the corresponding r. From the second equation of (40) we evidently have 

log x,-logr = J ( m t + xtSi)dt. 
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Since x*<l on account of the hypotheses concerning the region R, for 
<^0we may write -logr^iV,/ where N, is defined in (41). A corresponding 
inequality exists for t £0, i.e., -logr^ A/*/, so that we know that 

(44) 1*1 — * I log t | , 

where n is the greater of \/N, and - 1/Af*. 

Now for T7* 0, and (/, x t , x,) in R, the partial derivatives dj/dr, dx^/dr, 
and dxi/dT satisfy the equations of variation 


(45) 


a 

j/ _ 

a/ 

ar 

a 

ax* 

a/ 

dr 

a 

dx. 

a/ 

dr 


3 

—<i 

dr 

d -L 

dr 


dx, dx, 

V' r ” + ~d7 aTu ' 


dx. 


dx, 

-c 

dr 


'T„, 


dx, ax* 

ai + —*>T„ + —(m, + wT„), 


dr 


dr 


where w-x* x “ , Xa x - , and the T {/ , since every S { has x* x x, x as a factor, are de¬ 
fined by the equations 


0 )Tn - S,+JdS,/dJ, uT x , - fdS,/dx, + BSJdx *, «r„ = fdS,/dx, + dS,/dx t> 
o)T u - x,x,dSx/df, uT n - x£ x + x,x#S x /dxt t wiT*, » d{x,x£ x )/dx, t 
uT,i - x,x,dSt/df, <*t„ - a(**x»s*)/ax tl a>r„ - x*s* + *iT*a5,/ax,. 
Now for / - 0, a//dr - 0, dx,/dr - dx./ar = 1. With this in mind let"us set 

(46) * “ exp [ f Q (w ° + v “ cxp [ f o J , 

« - a//ar, t> + { = dx,/dr, w + T, ~ dx,/dr. 

Hence, when * = 0, y=t> = u; = 0. Now make use of the transformation (46) 
in the equations (45) and obtain 

du 

~ = u(m 4 + uT u ) + (® + {)«r M + (u> -f f )»r Mf 
dv 

( 47 ) ~ = + uT„) + (w + 17 )o>r„, 

au> 

— = ««r t i + (p + ()wr w + w(m, + uT„). 

Hence 
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u = exp [X (>»* + f o exp [-/: {m 4 + 

t ’ [(» + £)«?'»* + + i?)wTia]d<, 

(48) v = £ J* + (w + >7)«ria]rf/, 


in 


»? J •n-'luwTn + (v + $)uTn]dt. 


We shall limit the discussion for the time being to that part of the first 
quadrant of the x&t plane which corresponds to t^O since an argument 
similar to that about to be given can be made when t £0. Define U = max | u | 
for r fixed and / variable, and make corresponding definitions for V and W. 
Also define the constants A/</ so that «© > M it >n max |x* x ~'T ti | in R, where 
n is that in (44). Since x» decreases when t increases, we have for this region 
that x, is less than its corresponding r. From the first equation of (48) we 
obtain 

U s W'W„|t»-‘+ | e“''M n + M 11 ) r* -1 ] | log t | 

(49) S + Wlt„ + + Wu]r»-‘ | log r | 

a r—.+»-• | log x| [VM„+ WM» + r-'“<M n + AT,,]. 


In a similar manner from the other two equations of (48) we obtain 
V a | log r | [ UM„ + WM» + Mu), 

W a r ny,+x-1 1 log r | [VM» + VM„ + 


For i £0, if X were to satisfy the inequality 

(51) X > 3 + n(A/i + M t - Ni) 

we could show that U, V and W approach zero as r—»0 and hence that the 
partial derivatives dxt/dr and dx,/dr are greater than zero for r less than a 
certain number which is greater than zero. 

The inequality which corresponds to (51) for t£0 is the same as (51). 
Thus it follows from the definition of n and from (51) that it would be 
sufficient if X were to satisfy the inequality 

(52) X > 3 + (A/i + M t - N*)(l/N t - 1/Af,). 


But M lt Mi and N 3 depend to some extent on the choice of X. Let us choose 
X so large that 

X > 3 + L(mi + — w*j)(l/n* — 1/mj)* 

* X will also have to satisfy another inequality which arises from the determination of the other 
invariant surface. 
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where L is any positive number greater than 1, and then choose R so that 

(52) holds, which evidently can be done. 

But, since r < 1, the max |logr |r for the region R is when r = e~ l . Thus for 
r<e~ l we may write the inequalities (49) and (50) as 

u £ r(VM lt + WM lt ) + r(M lt + M„), 

(53) V g r(UUn + WMu) + rM u , 

W Z r(UAf$i + VM») + tM\\. 

These inequalities may be written in the form 

U - rMxtV - rM lt W £ r(M lt + Af lt ), 

- rMnU + V - tM u W £ rM sa, 

- tM u U - rAf»V +W£ rM lt . 

Since the determinant of the coefficients of U, V, W in the left hand members 
of these inequalities is definitely positive for r small enough and since each 
of the right hand members contains rasa factor and the M { , are constants 
independent of r, we see that for r small enough U, V and W are as small as 
we desire. 

This argument, with exception of that part of it relative to the choice of 
X, has been made for 0. A similar argument can be made for / £ 0 and hence 
as is seen from the equations (46), for r small enough dx^/dr and dx s /dr are 
greater than zero for (/, *,) in R and, as is still assumed, *,>0 and x a >0. 

Now examine the jacobian for the last two equations of (43). In R and 
for Xi>0 and x>>0 we have always dxt/dt greater than zero and dx 3 /dt less 
than zero. From these facts and those proved in the preceding paragraphs 
we see that we can find an open region R 0 defined by xj>0, *,>0, xf +xf 
<2r 0 * where r 0 is a constant greater than zero but so small that the jacobian 

I dxt/dr dXi/dr 1 
I dxt/dt dxi/dt I 

is always less than zero. Hence for this region R b we can always solve the 
last two equations of (43) for / and r as analytic functions of x* and x, and 
thus obtain/=/(xj, x 3 ) as has already been described. 

We wish now to show that the limits of/(x„ *,) and all its derivatives as 

*J~*° °f M f* - ’ 0 are zer0 - To d0 ^s let us note that proving any function 
G(x 2 , xj) satisfies the relationship 

($*) G(x t , xj) = 0(x? xj«) 
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for an arbitrary pair of positive integers p, q is equivalent to proving that 
it satisfies the relationship 

(55) G(x t ,x t ) =OM 

for an arbitrary positive integer r. It is understood that x, and x, are posi¬ 
tive and that r is the value of the parameter which corresponds to the point 
(xt, Xi) by means of the last two equations of (43). 

First of all let us observe that the last two equations of (40) imply that 


(56) r /Xt g r/x, £ r«*. 

Now consider that part of the quadrant in the XjXj plane which corresponds 
to /^0. For this region x,£r, and the equation (54) implies the existence 
of a positive constant M , such that |G(xt, x«) | gxi«3/„ where q is any posi¬ 
tive integer. Hence |G(x*, x,) | ^r«AfIn the same way we can show that 
for that part of the quadrant which corresponds to t £0 there exists a posi¬ 
tive constant M p corresponding to any arbitrary positive integer p such that 
|G(xj, x,) | £r p M p . If we define as the greater of M p and M q we see 
that for t positive or negative 

(57) | G(Xj,X|) | $tMrr 

for an arbitrary positive integer r, the M„ being a positive constant depend¬ 
ing on the r. Hence the relationship (54) implies the relationship (55). 

Now consider the converse situation. The equation (55) implies the exis¬ 
tence of a positive constant M tr and a relationship of the form (57) for an 
arbitrary positive integer r. On using the inequalities (56) we obtain the 
inequalities 

|Gl r’ r« 

- -L £- £ M rr T r -r-*-” ,M '+* nN '. 

XfXi* Xt Xt 


If we choose r so large that r-p-q-pnM t +qnNi>0, the quantity after 
the last inequality sign is finite and hence we have the result that the rela¬ 
tionship (55) implies the relationship (54). 

We are now in a position to discuss the behavior of /(x,, x,) and of its 
partial derivatives as x*—*0 or as x,-*0. From the first equation of (40) and 
from (39) and (42) we have that 


/ = exp 


whence 


(m 4 + Sa)rf/J J* exp£— J (m« + 5i)d/Jo(r r )tff, 
* r-". | O(r') | n | log r | £ | 0(r~*<-») | 


for r an arbitrary positive integer. Hence/(x,, x,) =0(x 2 »’x,«) for arbitrary 
positive integers p,q. 
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In order to discuss the behavior of d//dx, as £3— *0 or as £,—*0 we shall 
make use of the fact that 

df _ df dt df dr_ 

^ ^ dx, dt dx, dr dx, 

Since /(x,, x,) = 0 (x 3 p x,«) for arbitrary p, q as has just been proved, the first 
equation of (40) shows that df/dt has the same property. Now consider the 
equations (45) once again. From them we have proved that dx,/dr behaves 
like £ of (46) for r small enough and hence for x, or x, small enough. Hence 
dx,/dr, as x,-*0 or as x,-*0, is at most infinite like r - ""*. Similarly we have 
dxa/dr = 0(r nN> ). Now we are in a position to examine the behavior of dr/dx, 
and of dt/dx*. To do this let us consider the identities 



1 

dr dx% 

dr 

dx, 

(59) 

0 

dx 3 dr 

dr dx j 

’ JZ 

dr 

dr 

dx, 

and 


" dx, dt "* 

9 

dt dx, 

dxi 

dt 

dt 

dx, 

(60) 

1 

n 

dx, dt 

dt dx, 

dx, 

dt 

dt 

dx. 

Since the determinant 

u 

dx, dr 

dx, 

dr 


dx,/dr dx,/dr 
dxt/dt dxt/dt 


is zero only when x,-x ,«0 and then vanishes like r, we see that dl/dx 7 and 
dr/dx-i are at most infinite like a finite power of r _l as x 2 —*0 or as x # —*0. Let 
this power be m so that we have that dt/dx* and dr/dx 7 are 0 (r“") where 
m is some finite positive number. 

Now return to the first differential equation of (45) and note that in 
addition to the facts used in the discussion there made T X2 and Tu are 
0(£ 2 *x,«) for arbitrary positive integers p t q. On integrating this equation 
and using the inequalities just established we obtain the result that 


V 


dr 



exp cxp [-/ o (m 4 +u>r„)d/J 

r dx 7 dx t 1 I 

’LdT Wri * + ~d7° iT ' i \ d ^1 “ T ^l 0 ^)N»ogr| = | 0(r f ) 
for an arbitrary positive integer r. 
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Out of this discussion and the equation (58) follows immediately the fact 
that df/dx 7 = 0 (r r )= 0 (x^x a «) where r, p and q are arbitrary positive inte¬ 
gers. It is evident by induction, using the equations (40), (59) and (60), 
that corresponding statements are true for all the partial derivatives of 
/(*i, x,) with respect to x t and x, and hence that all these partial derivatives 
are zero for x 2 = 0 or for x, = 0. 

Although the function /(x,, x a ) which we have just been discussing has 
been defined only for the quadrant of R for which x, and x t are positive, it 
is readily seen that it can be defined in the other quadrants in a similar man¬ 
ner and will have the same properties with respect to these quadrants as 
it has with respect to the first one. When these four functions are joined along 
the axes they unite to form a function which is of class C* in a neighborhood 
of the origin, is analytic for Xa^O, x,j*0, and is zero together with all its 
partial derivatives when Xa = 0or when Xi~0. This completes the proof of 
Lemma 3. 

V. First non-analytic normalization 

Thus we have found two invariant surfaces other than x*»0. For pur¬ 
poses of notation let the equations of these invariant surfaces be 

*i - xi xffi(xt , xt) - F(x t , xj), 
x t = x?xtgi(x x ,x t ) =G(x i,x$). 

Reviewing the properties of F and G, we know that the function F(x*, x*) 
is of classC" in a neighborhood |x* |, |x» | <r, t r, > 0 of x, - x, - 0 and is analytic 
at any point (x*, x a ) of this neighborhood for which x*?*0, x a ^0; the function 
G(x i, x 3 ) is of class C* in a certain neighborhood, |xi|, |*a|<r ff , r,>0, of 
Xi = X| = 0 and is analytic at a point (xi, x 3 ) of this neighborhood for which 
xi j* 0, x 3 ^ 0. Let r 0 be the smaller of r, and r 0 . 

The aim of this section is to use these invariant surfaces in a transfor¬ 
mation which will transform the differential equations (30) into ones of the 
same form in which the (?i = (?i = 0. Now the invariant surfaces (61) fail to 
be analytic along the axes and we wish to set up the equations of a transforma¬ 
tion which will carry these surfaces into coordinate planes, and will be ana¬ 
lytic except possibly for those points which lie on these surfaces. To this 
end let us examine the transformation 

xi = yi +F[xt - G(ji f y») t y»], 

(62) x t = yt + G[x i - F(y*, y») ,y«], 

x$ - yt, 

and confine our attention in the x< space to a region R, about the origin for 
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which \x i \<r 0 , and in the y t space to a region R , about the origin for which 
\ yi | <r 0 . Furthermore, let R, and R y be so chosen that the transformation 

(62) establishes a one-to-one correspondence between the points of R . and 
those of R y , which clearly can be done so that the origin (xx, Xa, x t ) = (0, 0, 0) 
is interior to R, and the origin (y u y*, y») = (0, 0, 0), which happens to 
correspond to the origin in the x 4 space, is interior to Ry. From the proper¬ 
ties of F(x a, x a ) and of G(x Xt x a ) it follows that when yi = 0, Xx = F(xa, x a ), and 
when y a =0, x, = G(xi, x a ). Moreover, this transformation (62) is of class C° 
and can fail to be analytic only when yi = 0, y a — 0, or y» =0. 

On solving the equations (62) for x, we obtain 

(63) x x = Myi,yt,y*)* ** = Myi,y*,y*), *» = >•» 

where the functions /< are of class C~ in R v and are analytic at any point 
of Ry for which yx^O, yu*0, y a ?*0. Furthermore, in any one of the surfaces 
y,= 0, y a =0, or y a = 0, the functions /< are of class C" in the variables of the 
surface, analytic at any point which does not lie on either of the axes in the 
surface, and are analytic along either axis in the variable of the axis in a 
neighborhood of the origin. This statement is obvious for y a -0. For yi = 0, 
Xx-F(y s , yj) and Xt = y*. On account of the properties of the F(xt, x,) the 
statement is seen to be true when yi -0. In a similar manner it can be shown 
to be true when y% — 0. 

Since the /< are of class C" in the Ry we may formally expand these 
functions as power series in y a or as power series in y x and yi together. Let 
these expansions be written as 

x<~a <0 (yi,y«) + or«(yi,yt)y« + • • • 

■ crioo(yi) + «<io(yi)yi + of»oi(yi)yi + • • • , 

* = 1,2. We shall now prove 

Lemma 4. The a«(y,, y0 of (64) are analytic in y lt y, for \y x |, |y a | <r 0 , 
and thectijkiyt) are analytic in ytfor |y a | <r 0 . 

Since F(x j, x*) and G(xi, x*) each contains x>* as a factor, it is seen that 
<XiQ=y<, * = 1, 2. We shall denote the partial derivative of F with respect 
to its first argument by F x , and with respect to its second by F t . Further¬ 
more, when the arguments of F are y t , y« we shall write it merely as F, 
but when the arguments are Xj— G(y x , y a ), y a we shall write it as F*. Corre¬ 
sponding notations will be adopted for G and its partial derivatives. 

Now differentiate the first two equations of (62) partially with respect 
to y a . Hence 
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whence 

(65) 


dx x /dx* \ dx, /dxi \ 

— -Pt+F?l— -c\ -— = Gt + G*(-- Ft\ 

dy, \dy, / dy, \dy, / 


—(1 - FfG. n = Ft + PtGt - PtGfPt - FfG,, 
dy. 


—(1 - FtGt) = Gt + GtFt - GtFtGt - «P*. 

dy. 


Now for y, = 0, F\G\ and the right hand sides of the equations (65) are zero 
since F and G both contain xj* as a factor. Hence dx\/dy, and dx*/dy, are 
both zero for y, = 0. 

Now recall the fact that F, as well as all its partial derivatives, is analytic 
in x , for \x% \ <r 0 , x, = 0, and that G and its partial derivatives have corre¬ 
sponding properties. Hence any partial derivative of any order of F* or G* 
with respect to the arguments Xt — G, y, t and x» — F t y, t respectively, is 
analytic in y, and y lt respectively, for \y t | <r 0 , ya-0, since yi = Xi and y, = Xt 
foryi=aO. The nth order partial derivatives-of F* and G* with respect to 
y, are polynomials in the partial derivatives of x x and x, with respect to y, 
up to and including those of the nth order. The coefficients in these poly¬ 
nomials are polynomials in F*, G*, and their partial derivatives with respect 
to their respective arguments. Hence, from the equations (65) we may 
determine seriatim all the partial derivatives of x x and x, with respect to 
at yi« 0 as functions of y x , y, which are analytic for \y x |, \y t \ <r 0 . 

Now differentiate the first two equations of (62) partially with respect 
to y x and y%. We obtain on simplification 


£*[1- 

-PfGf ) - 1 -PtGi, 

a*. a*, 

- a Gi -; 

dy x 


dy x 

dyx 

~ [l m 

-PtGt) - 1 -GfPi, 

£fi = p ,t f*. 

dy i 


dy. 

dy. 


Now when y x = y, = 0, xi = x, = 0. Hence, by using an argument similar 
to that used in connection with the partial derivatives with respect to y, t 
it is seen that from (66) we may determine seriatim all the partial derivatives 
of Xx and of x, with respect to y x and y, at y x =y, = 0 as functions of y, which 
are analytic for | y, | <r 0 . This completes the proof of the lemma. 

From the equations (65) and (66) we can now prove that all the partial 
derivatives of x x and x,, when y x , y ,, or y, is zero, are of class C® in the re¬ 
maining variables and are analytic in these variables provided neither is 
zero. We shall prove this only for the dxx/dy, for y x = 0 because an induction 
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proof can be made for all the other derivatives except those of the first order 
and a proof can be given for them along the lines of that to follow. As we 
have already seen, if yi = 0, then x, = y,, x, = y«, and x x = F(y t , y»). Hence the 
coefficient of dx x /dy t in the first equation of (65) is 1 for y»==0, and it is 
readily verified that the right hand member of this equation is of class C" 
in yt and y,, and is analytic in these variables for y t * 0, y a ^0. 

Now let us perform the transformation of variables defined by the equa¬ 
tions (62) or the equations (63). The differential equations (30) will be 
transformed into differential equations of the same form in y< since under the 
transformation (62) the xi = x, = 0 axis and the x,-0 plane are invariant. 
From the definition of yi = 0 it follows that when an integral curve has at 
any point y x - 0, it has y x - 0 for all its points in the neighborhood of the origin 
under consideration. A similar statement is true concerning y t . Hence the 
right hand members of the first two equations in our new system have y x 
and y t , respectively, as factors. Thus the new equations have the form 


(67) 


* yi( m i + y»Ui)> 
at 

* 7 - ■ y«(**« + y*^*)» 

at 

-j- " ?•(**• + y\V • + ytUt) , 
at 


in which the U t are functions of the y f which are of class C" in a neighbor¬ 
hood of the origin and are analytic at any point of this neighborhood for 
which y/9*0 t j — \, 2, 3. Furthermore, the U t and all their partial derivatives 
have the property that in any one of the coordinate planes y t —0,j — 1, 2, 3, 
they are of class C~ in a neighborhood of the origin, analytic at any point 
of this neighborhood which does not lie on the coordinate axes of this plane, 
and analytic along any one of these coordinate axes in the variable of the 
axis in a neighborhood of the origin. All these statements follow at once from 
the properties of the transformation (63), because on differentiating the 
equations of (63) we obtain 


( 68 ) 


dx x dfi 

dyi 

4 . 

•Vi 

dyi 


dy. 

dt 

dyi 

dt 

i 

dyi 

dt 


dt 

dx t dft 

dy x 

4_ 

d/t 

dyi 


dyi 

dt 

dy x 

dt 

I 

dyi 

~dt 

dy$ 

dt 

dx t dft 

dy x 

_L 

df* 

dyi 


dyi 

dt 

dyi 

dt 

1 

dyi 

dt 

dy% 

dt 


xi(mi -f xiPi) + xtxtQ x , 
xj(m* + XiPt) + xfxfQi, 
x»(«| + X X Pi + XtPJ . 
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Now replace the x, in these equations by the functions of y, as given in (63). 
Since all the functions of y,- which now appear in these equations ( 68 ) as 
well as all theij partial derivatives have the same properties of continuity 
and of analyticity that the /< themselves have, we see at once that our state¬ 
ment concerning the Ui and their partial derivatives is true. 

VI. Second non-analytic normalization 

The aim of this section is to reduce the system of differential equations 
(67) to an equivalent system of the same form for which the formal series 
for the <tn will consist in the linear terms only. To this end we shall first 
prove 

Lemma 5. Consider the formal series 

x !+•••+ - x x m xf x? + • • • 

m\n\p\ 

(69) — *i + a x (xi,xt)x% + • • ■ 4- a p (x lt x t )xtr + ••• 
"*»+•••+ a mn (x t )x l m xf + • • • 

in which the only linear term is x Xf every other term contains x x x% or x^x* as a 
factor , a p are analytic in x lf Xj for |x» |, |x* | <r, r > 0 , and a mn are analytic 
in Xifor |x« | <r. Into the series (69) substitute the formal series 

(70) Xi~ yi + 0u(yi,yt)x> + • • • + 0<*(yi,yt)*i* + • • • 

“ y* + -h 0im*(xi)yryf + • • • , i - 1,2, 

in which the only linear term in the ith series is y<, every other term contains 
y&t or y t x x as a factor , the fl ip are analytic in yi, y t for \y x |, |y*| < r and 
Ptm* are analytic in x%for |x 3 1 <r, and arrange the resulting series as 

( 71 ) *~Yo(yi,y*) H-+ y P (yuy%)x? H- 

= 7oo(*a) +-h 7-«(xj)yry," + • • • . 

Then y { are analytic in y lt y t for |y» | F |y*|<r and y mn are analytic in x t 
for |x 3 1 <r. 

The truth of this lemma is almost obvious. From the properties of the 
series (69) and (70) it follows at once that yo-y x and that 700 = 0. It readily 
follows that 7i=0u(yi, y*)+ori(yi, y a ) which is obviously analytic for |yi|, 
|y 2 1 <r. Seriatim, we may prove that all the y p are analytic for |y x |, |y 2 1 <r, 
and in a similar manner we may prove that all the y mn are analytic for 

M<r. 

Now consider the effect of the transformation (62) on the formal series 
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<£,. From Lemmas 4 and 5 it follows that the series fa when expanded in 
powers of y, have the property that when they are arranged as power series 
in y 3 the coefficients are analytic in y x and y 3 for \y x |, |y 3 1 <r 0 , and when 
arranged as power series in y x and y i the coefficients are analytic in y 3 for 
\ yi 1 <r 0 . But these series in y, obtained by substituting the series of (64) 
for the Xj in the series for the fa are the same as the formal series solutions 
of the partial differential equations 


(72) 


— yi(mi + y 3 Ui) + y*(tnt + y%Ut) 

dy x dy t 

dfa 


+ —?•(»*« + yiU* + ytU 4 ) = mrft, 
dyt 


since the equations (72) are the transforms of the equations (31) under the 
transformation (62). From the form of the equations (72) it follows that the 
formal series for 0, has y< as a factor for x = l, 2, 3, respectively. Hence 


we may write 

(73) ~ y«(i + • • • + *"* ’- .yrytyt +■■■') (* — 1,2,3). 

\ minipi / 


in which the series within the brackets have the same properties relative to 
the convergence of sets of terms as the series for the 0< themselves have. 
Furthermore, the series for fa and fa reduce to y x and y 3 , respectively, when 
y,- 0, and the series for fa reduces to y 3 when yi**yi = 0. This follows im¬ 
mediately from the form of the partial differential equations (72). 

Now “fit” the series within the parentheses appearing in the equations 
(73) by means of three functions Fi(y x , y,, y>) in the manner described in 
Lemma 2, leaving the constant terms as they are. Hence we have the trans¬ 
formation 


(74) Zi * y<(l + Fi(y x ,y t ,y 3 )) (i = 1,2,3), 

defined in which the have the properties relative to the <t>> which are pecul¬ 

iar to their mode of definition. 

We wish now to determine the type of the system of differential equations 
which the z, satisfy when the y, in the equations (74) are replaced by an arbi¬ 
trary solution y,(0 of the differential equations (67). On account of the pro¬ 
perties of the power series fa which were mentioned in a preceding para¬ 
graph, we see that the transformation (74) has the property that if y x =ys = 0, 
then Zi =Z 2 = 0 and z 3 =yi, and if y 3 = 0, then z 3 = 0, Zi =yi and z 2 = yj- Hence 
the form of the differential equations which the z, satisfy is the same as the 
form of the differential equations which the y< satisfy, since also y< = 0 
implies that z<«=0 for i = 1, 2, 3. 
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On remembering that 

dti dzi dzi dzi 

— = -— yi(mi + y»Ui) + — yi(tn t + y x U j) + — yi(m> + y\U% + y« U4), 
dt dyi dy% dy t 

and on recalling the definition of the z y we see that 


dti 

dt 


Ri 


1,2,3), 


where the are functions of the y / which have the same properties as have 
the Fi in a neighborhood of the origin and are zero together with all their 
partial derivatives for yi = y t = 0 or for y* = 0. From these properties of the 
Ri and those of the transformation (74) it follows that the functions R t and 
all their partial derivatives with respect to the Z/ are zero when Zi = Zi=0 or 
when z, = 0. 

From this argument follows 


Lemma 6. Under the transformation (74) the system of differential equations 
(67) is equivalent to a system of the type 


(75) 


dzi 

~dt 

dz t 

dt 

dz% 

dt 


ti(mi + z x Ti), 


zt(m x + z%Tt), 


z$(m t + ZiT 9 + ZtTi ), 


where T t are functions of z it are of Class C" in a neighborhood of z x = z t = z, = 0, 
are analytic for Zi^O, Zt^O, z a ^0, are analytic together with all their partial 
derivatives in z it z f for z k =0, *V/, and are zero together with all their 

partial derivatives when z x = z x = 0 or when z> = 0. 

It may be noted here and will be found of use in a later section that the 
properties of the T t imply that each is 0[(zi* +Za s ) p Za«) for any positive 
integers p, q. 


VII. Final reduction 

In this section we shall make the final reduction, i.e., we shall find a trans¬ 
formation which will reduce the equations (75) to the form (8). The finding 
of such a transformation will be effected when a certain set of three functions 
is determined. Since all three of these functions can be determined in exactly 
the same way, we shall limit ourselves to the determination of the first. 
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In fact we shall prove the following lemma by showing only how to obtain 
the first of the functions g »: 

Lemma 7. There exists a transformation of the type 

«• - s<gi(f&»s*,ss) (i = 1,2,3), 

where the functions gi(z Xt z,, z>) are 0 / class C* in a neighborhood of the origin 
and are analytic for Zi^O, z,^0, z,^0, wmkr wAicA /A* differential equations 

(75) are equivalent to the differential equations 

dzT 

— = mz? (i « 1,2,3). 

dt 

Furthermore , «m&r /Aw transformation the plane z* = 0 corresponds to the plane 
Zi*- 0 /or i—l, 2, 3. 

The discussion will be relative to a closed region 5 of the z< space which 
contains the origin as an interior point and in which z? +zf < 1 ,zf < l,m x +z t T x 
and mt+ZiTt are greater than zero and m,+zir,+z,r« is less than zero. It 
will be further assumed that 5 is composed only of points which lie on integral 
curves of the differential equations of the system (75) which intersect the cone 

(76) *i* + " °- 

It is readily seen that this hypothesis concerning S is compatible with the 
others, since, according to the others, the slopes dz x /dz t and dz^/dzi of any 
integral curve in 5 are zero or infinite only for those integral curves lying 
in the z*-0 plane or in the Zi = zt-0 axis. This implies that the integral 
curves of the differential equations (75) passing through an arbitrary point 
in a sufficiently small neighborhood of the origin will intersect the cone (76) 
in S. On account of the signs of the left hand members of the equations 
(75) in S, it is seen that every integral curve in S with the exception of those 
lying in the z 5 = 0 plane or in the zi — zt = 0 axis intersects the cone in only one 
point. It will be found convenient to define So as S with the plane z« = 0 and 
the axis Zi *= Zi = 0 removed. 

If we write the equations of the cone (76) in the parametric form 

(77) zi — ri, zt = T t , z« = ± (rx* + r^) l/t , 

and ch r*>e these equations as the equations giving the initial values of the 
Zi when / = 0, we see that the points in S 0 are placed in a one-to-one corre¬ 
spondence with the points in the corresponding n, t 2 , / space by means of 
the equations of the integral curves of the differential equations (75), 

(78) z. = z.(r„r,,/) (»= 1,2,3). 
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where the functions z. are of class C* in their arguments for (z,, z,, z a ) in 5 0> 
and are analytic if ti^O, t 3 ?* 0 . 

Let us introduce the following notation relative to the region S: 

Mi = max (mi + Za7'i), Mt = max (m 3 + Zj7j) , M% = max (m a + z x T% + Z2T4) , 

(79) 

N 1 = min (mi + z t T x ), Nt = min (m 2 + zaTj), = min (m a + Z\T* 4* Z 2 T 4 ). 


Let Mo be the larger of M x and M t , and let N 0 be the smaller of N x and N t . 

For this region we wish to find how the t of any point of S 0 varies with 
the r{ +t} of this point. From the first two equations of (75) it follows that 

d(zi* + zj*) _ 2zi a (mi + z$Ti) 4- 2z a a (m a + z»T 3 ) ^ 

Zi* + *i* + 2** 

Hence 


log (it + it) - log (r,« + T t) - 


2zi > (m l + ZjT\) 4- 2za*(m a + z 3 T 3 ) 
zi' + z 2 * 


dt. 


Since z, a +Za a is less than 1 in S, log(zi a +z, a ) is negative. Consider that portion 
of So which corresponds to /£0. For this portion zi a +z* a is greater than 
Ti* 4T 2 a . Hence t ^ — [log(ri a 4r, a )) [l/(2A7 0 )] since N 0 is less than the in¬ 
tegrand of the integral in the above equation. 

By means of a similar argument with respect to the third equation of 
(75) for t£0 we can show that — |log(r E a + r 2 a )] [1/(2A7,)]. If we let N 
be the smaller of 2N 0 and |2A7, | we see that 

(80) M S - [log(r.* +r,«)][l/W], 

where (r lf r a> t) corresponds to the point (z >f Zj, z a ) in 5 0 . 

Now the first equation of (75) is transformed into one of the same form 
by any transformation of the type 


(81) z,* = zigi(zi,z*,z>), Zj* = z it za* = z, t 


where gi(0, 0, 0) is 1. This follows from the fact that this transformation 
leaves the Zi=0 plane invariant. Thus we wish to determine the func¬ 
tion g,(zi, Za, z 3 ) so that the corresponding Zi* will satisfy the equation 


(82) dt?/dt - mizi*, 

when the z< in the equations (81) are replaced by an arbitrary solution z<(/) 
of (75). . 

On differentiating the first equation of (81) with respect to t and using 
the fact that we wish z* to satisfy the equation (82), we obtain 
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whence 


dzf 

dt 


dz x dgi 


d log gi 
dt 


— z»Ti. 


If we set/i = log gi t it is evident that, if we can find a function/i */i(zi, z*, z 3 ) 
of class C° such that 


(83) 


dh 

dt 


= - tiTi 


along every integral curve of (75) and such that /i(0, 0, 0) is zero, we have 
solved our problem because in the same way in which we found/i we can find 
functions / 2 and / 3 corresponding to z, and z 3 , respectively, in the same way 
that fi corresponds to Zi. 

Now determine the solution 

(84) /» = *i( T »» T *»0 

of the differential equation (83) which corresponds to the solution (78) of 
(75) and which has the initial value /t-0 for * = 0. On account of the fact 
that the right hand members of the differential equations (75) and (83) are 
of class C° in their arguments and are analytic if z<^0, f-1, 2, 3, the func¬ 
tion hi of (84) is evidently of class C~ in its arguments for +t 2 * and 
analytic for n*0, r,^0. Since by means of the equations (78) we can 
express n, r 2 and / as single-valued functions of z< of class C" for (z», Zt t z 3 ) 
in So which will be analytic if *<*0, i -1, 2, 3, we see that we may write the 
equation (84) as/i-/i(*i, Zt, z 3 ) where the function/i(z,, z,, z 3 ) will be defined 
only for (zi, Za, z 3 ) in S 0t and will be of class C" in its arguments for this region, 
analytic if z.^0, »-1, 2, 3. We need now only to investigate how this func¬ 
tion behaves as z 3 -*0 or as (zi, z*)-*(0, 0). We shall prove that it and all its 
partial derivatives have limits zero as z 3 -+0 or as (z t , Zi)-*(0, 0) and hence can 
be so defined that they are of class C* for (z,, z 2 , z 3 ) in a certain neighborhood 
of (0, 0, 0). 

Let us first observe that, on account of the similarity of the situation here 
to that in Section IV, it can be readily shown that proving a function 
/(zi, ^ z») to be 0[(s?+*/)’**'] is equivalent to proving it 0[(r, J +r 2 J )'] 
where p , q and r are arbitrary positive integers. 

Now we are in a position to discuss the behavior of /i(zj, z*, z 3 ). It follows 
from the equation (83) and from the inequality (80) that 
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| /, | = I f ‘o[W + z})W']dt £ I 0[(r,* + tJY] | • I log (r,» + T,») | ■ l/N 


for an arbitrary positive integer r, since T x = 0[(zi s +zf)’z t iq ] for arbitrary 
positive integers p, q. Hence /i(z», Zj, z a ) =0[(z^for arbitrary 
positive integers 

By the method used in Section IV for the function /(x,, x a ) it can be 
shown that every partial derivative of f x {z x , z,, z a ) is also 0[(z? +«/)’£»**] 
for arbitrary positive integers />, 7 , since all the partial derivatives dz x /dr x , 
dri/dzi, dzi/dt, etc., are at most infinite like a finite power of (t^+t/)” 1 . 
Hence the function /i(zi, Zi, z a ) is of class C* in a neighborhood of the origin 
and therefore, on account of its definition, the function g x (z x , z,, z a ) of (81) has 
the same property. Since the analytic properties of the J x also imply that the 
g x has corresponding analytic properties, and since the other functions g t 
and gi can be determined in the same way in which the g x has been determined 
and will have corresponding properties of continuity and of analyticity, we 
see that the proof of Lemma 7 is complete. 

We may summarize the discussion of this and the preceding sections in 
the concluding 


Theorem 2. There exists a transformation of the type 

(85) x<=xM t t?,z?) (t- 1,2,3), 

where the functions x<(z,*, z** f z a *) are of class C" in J some neighborhood 
(*■*, zf, zf) - ( 0 , 0 , 0 ) and are analytic for z,* ^ 0 , z,* 5 * 0 , z a * ^ 0 , under which the 
differential equations (7) are equivalent to the differential equations 

dzT 

<* - 1,2,3), 


in a certain neighborhood of the origin, which point remains invariant under the 
transformation. Furthermore, when the equations (85) are solved for the zf as 
functions of the x f and are expanded as formal power series in the x f , if the 
equations (7) have the form (12) the formal power series for zf will formally 
satisfy the partial differential equation (13), and the formal power series for 
zf and zf will formally satisfy the analogues of (13) in fa and fa. 

Hahvabd University, 

Cambridoe, Mass. 
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NOTE ON LINEAR DIFFERENCE AND DIFFERENTIAL 

EQUATIONS 

By George D. Birkhoff 

Department op Mathematics. Harvard University 
Communicated December 13, 1940 

The modern analytic theory of ordinary linear difference equations had 
its inception in an early paper of Henri Poincar4 published in 1885. More 
than two decades later the work of N. E. Norlund, R. D. Carmichael and 
myself led to what, in a certain limited sense, is a '‘general” theory. More 
recently the existence of a full complement of formal solutions has been 
established in all cases, and on this basis W. J. Trjitzinsky and I have given 
jointly a truly general theory. 1 

Much before this latest work, the very clear conviction had come to me 
that the analytic theory of linear difference equations provided a com¬ 
plete methodological pattern in the formulation of a comparable general 
theory for the essentially simpler but analogous field of linear differential 
equations. This point of view was altogether substantiated by Trjitzinsky 
in his subsequent development of the general theory of linear differential 
equations.’ 

What I desire to point out in the present Note is that the methodological 
relationship between the two fields is so close that the theory of linear 
differential equations falls out as an immediate corollary of the theory of 
linear difference equations. For the sake of simplicity, however, I shall es¬ 
tablish this fact only for the typical case of a linear differential system (in. 
matrix notation) 

Y'(x) - A(x)Y{x) (1) 

with an irregular singular point of rank 0 at x = «, so that the nth order 
square matrix A{x) s (fly(x)) is analytic at x = A graduate student 
at Harvard University, Mr. E. C. Gras, is undertaking to carry out system¬ 
atically the extension of this program to the general case. 

In the first place there is an immediately provable theorem* which allows 
us to conclude that any matrix solution Y(x) of (1) satisfies a functional 
equation 

YMx)) = M(x)Y(x) 

where M(x) is analytic at x = ®, provided that y>(x) is a function of the 
form (case q = 0 of the theorem) 


<p(x) = x + l + — -f 
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For our purposes here it suffices merely to take 

¥>(*) — * +. 1 

and thus arrive at the difference equation 

Y(x + 1) = M(x)Y(x) (2) 

where M(x) is analytic at x = <=. 

Now in accordance with the general theory of the difference equation (2), 
there will exist a formal matrix solution S(x) which has the form 

S(x) - (pjWsijix)) (3) 

in case the characteristic equation 

\p6ij - m v (<»)| = 0 

has n distinct roots p x , ... p„; here the expressions fy(x) stand for de¬ 
scending formal power series in 1/x such that the determinant of the lead¬ 
ing coefficients |$y( ® )| i s not zero. We confine attention to this case. 4 

But the general theory in this non-singular case 5 shows that there will 
then be two special "principal solutions” K+(x) and Y-(x) such that the 
asymptotic formulae hold: 

Y+(x) ~ S(x), - ^ - c £ arg x g ^ -f €, 

Y-(x) ~ 5(x), ^ - « £ arg x £ ~ + «• 

The most general solution of (2) may be written as Yo(x)P(x) where Ko(*) 
is any particular matrix solution and P(x) is a periodic matrix of period 1. 
Hence any solution K(.r) of (1) may be expressed in the first one of the 
specified sectors as 

Y(x) = Y+(x)P(x) (5) 

Here P(x) is not only periodic of period 1 but analytic in the finite plane 
with |P(r)| 4= 0, since | Y{x)\ 4= 0 and | K+(x)| 4= 0, for |x| sufficiently large 
in the right half plane. Substituting this expression in (1) we readily obtain 

K + -i(*)[K + '(x) - A(x)Y + (x)) = P'(x)P-'(x), (6) 

which leads at once to an asymptotic relationship in this sector 

P\x)P~'(x) ~ (Ov/p,)V%(*)). (7) 

The matrix P’(x)P~ x (x) appearing here on the left is of course analytic 
in the finite plane and periodic of period 1. The expressions ty(x) appear- 
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ing on the right stand for descending formal power series in \/x. The pre¬ 
cise meaning of the asymptotic relationship written is obvious. 

But if we transfer attention to the z-plane where 

z = e uix , 

this relationship shows that the typical element q^z) of the matrix on the 
left in ( 7 ) is not only single-valued and analytic for z 0 , ®, but is of 
finite order at these excepted points. Thus q is (z) must be polynomials in 
1 /z and z. The same relationship also shows this to be impossible unless 
t tj (x) m 0 for all i and j. It follows therefore from (7) that S(x) is a formal 
matrix solution of the given differential equation ( 1 ), and that we may put 
tij(x) sOin (7): 

P'(x)P-'(x) ~ <0>,/,.)‘(0)). (8) 

However it is then clear that the product <fo(z)go(l/z) is not only ana¬ 
lytic for z =1= 0, ®, but vanishes at these points. Hence the matrix P'(x)- 
p-*(x) and so P'(x) vanish identically. Consequently P(x ) is a constant 
matrix, of constants C; and, by ( 6 ), K+(x) is a solution of the given dif¬ 
ferential equation (1), while F(x) = Y+(x)C is the given solution of (1). 

It is thus immediately established that the given linear differential sys¬ 
tem (1) admits of matrix solutions K(x) asymptotic to S(x) in the two 
specified sectors covering the entire plane. From this follow at once all the 
usual facts about the asymptotic behavior of solutions in the complete 
neighborhood of x = ®. 

There can be no doubt that the intensification of this type of argument, 
based on the general theory of linear difference equations, will lead easily 
and directly to the asymptotic characterization of the solutions of linear 
differential equations at any regular or irregular singular point. Further¬ 
more it is equally clear that in a similar manner the formal theory and the 
inverse Riemannian theory for differential equations follow directly from 
the corresponding results for difference equations. 

1 G. D. Birkhoff, "Formal Theory of Irregular Linear Difference Equations," Acta 
Mathcmatica, 54 (1930); G. D. Birkhoff and W. J. Trjitzinsky, "Analytic Theory of 
Singular Difference Equations,” ibid., 60, 1-89 (1936). 

* W. J. Trjitzinsky, "Analytic Theory of Linear Differential Equations," ibid., 62, 
167-226 (1938). 

* G. D. Birkhoff, "Theorem Concerning the Singular Points of Ordinary Differential 
Equations,” these Proceedings, 1, 578-581 (1915). 

4 If this restriction is not satisfied, more complicated types of series and a more com¬ 
plicated analytic situation may arise, but the general course of the argument is not 
affected. 

‘See, for instance, my paper, "General Theory of Linear Difference Equations," 
Trans. Am. Math. Soc., 10, 430-470 (1909). 
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GENERAL THEORY OF LINEAR DIFFERENCE EQUATIONS' 

BY 

GEORGE D. BIRKHOFF 
• 

Introduction. 

The theory of linear difference equations with rational coefficients was in a 
very backward state until PoiNCARfcf in 1882 developed the notion of asymp¬ 
totic representation, and its application to this branch of mathematics. Further 
important progress in this direction has been made only very recently, notably 
by Galbrun,| who by means of the Laplace transformation has investigated 
the properties of a set of fundamental solutions in the entire plane of the com- 
plex variable x. 

The aim of the present paper is first to study the nature of these solutions to 
which I have been led by means of direct methods, and secondly to show that 
there exists a purely Riemannian theory of linear difference equations. In par¬ 
ticular certain ratioual functions of are shown to play a part like that 

of the monodromic group constants of an ordinary linear differential equation. 

To the best of my knowledge, the importance of the functional standpoint in 
the field of difference equations was emphasized first by Van Vleck in an 
inspiring series of lectures given at the University of Wisconsin in the spring 
of 1909, in which he conjectured the existence of sets of solutions analytic on 
either the left or the right side of the complex plane. 

On account of the extreme simplicity of the matrix notation, I have found it 
convenient to deal with a linear difference system of n equations of the first 
order 

(1) ?,(■* + I )=Z %(*)?>(*). «= 1,2, 

rather than with a single equation of the nth order. In these equations the 
functions a (> (x) are taken to be rational functions of x with a pole at x = oo of 
order \i at most, so that one has 

(2) a j/( x ) — a tfX" + <x*-' + •••, (l*|>R). 

* Presented to the Society, September 6, 1910. 

t American Journal of Mathematics, vol. 7 (1885), pp. 203-258. 

JComptes Rendus, vol. 148 (1909), pp. 905-907. 
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The sole other restriction to be imposed on these functions below will, in par¬ 
ticular, be satisfied if the deterininantal characteristic equation 

(3) K-V/l = ° 

[where S tJ = 0, i #= j; S u = 1] has n distinct roots which are not zero. If the 
n sets of functions 

9n( x )' 9.i( x ) 


are n linearly independent solutions of (1), the array of elements g v (x) forms a 
matrix solution G(x). The array of elements a^x) forms a second matrix 
A(x) y and the n* equations which the n solutions satisfy can be combined into 
a single matrix equation 

(4) G(x+l)=A(x)G(x). 

If x be replaced by x — 1, this may be written 

(4)' G(x- 1)=* A~ l (x — 1 )G(x). 

The most general matrix solution of (4) is furnished by 

(6) ' H(x)-G(x)P(x), 

in which G(x) is any particular matrix solution, and P(x) is any arbitrary 
matrix of periodic functions of period 1 whose determinant does not vanish 
identically. 

The equation (4) admits of two symbolic solutions 

f (?(x) = A(x— 1 ) A(x — 2) • • •, 

|{r(x) = A~ x (x)A~ x (x + 1) •••, 

which converge to limit matrices only in particular cases. This convergence is 
secured to a sufficient exteut by a suitable modification, made in § 1, as follows: 
The system (1) is known to be satisfied formally by taking for g x (x)% &(*)» 
* * ’» 9n(x) certain formal series, one set of them for each of the n roots p of the 
characteristic equation (3). The matrix formed from these sets taken in a cer¬ 
tain order is denoted by S(x ), and the modified matrix obtained by breaking 
off these elements at the &th term is called T(x). The sequences of matrices 

A(x — \ )A(x— 2) • • • A (x — m)T(x— m), 

A~ x (x) A~ l (x + 1) • • • A~ x (x + m — 1 )T(x -f m), 

form the desired modification of the two symbolic solutions. It is demonstrated 
that all the determinants formed from the first A. columns and any \ rows of the 
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first of these matrices, (X = 1,2, • ••, n), or from the last X columns and any 
X rows of the second of these matrices, converge to limit functions independent 
of k as m becomes infinite. By the aid of these functions and by a process of 
summation based on a contour integral suggested by the one which Carmichael* 
has used, two remarkable particular matrix solutions G (x) and //(x) are 
obtained in §§ 2, 3, 4. These are termed the first and second principal matrix 
solutions respectively. The elements of G (x) are analytic, save for poles at 
points <r + 1, <r + 2, •••, where a is a pole of one of the elements of A (x), 
and have the asymptotic form of the corresponding elements of .9(x) in any 
left half plane. The elements of //(x) are analytic save for j>oles at points 
a — 1, <r — 2, where <r is a pole of one of the elements of A~' (x — 1), and 
have the asymptotic form of the corresponding elements of S(x) in any right 
half plane. The principal matrix solutions G (x) and //(x) are uniquely deter¬ 
mined by this asymptotic property. A relation (5) will also, of course, subsist 
between G(x) and //(x). 

Conversely if G(x) and //(x) are asymptotically represented by the elements 
of a matrix S(x) even to terms of the first order and are related by means of 
a matrix P(x) of periodic functions, as in (5). these matrices G(x) and //(x) 
will form the first and second principal matrix solutions of a system (4) in which 
the elements of A (x) are rational (see § 7). 

It is proved in § 5 that the elements of P(x) are rational functions of 
The structure of these functions is determined with particular reference to the 
case when the functions a ; (x) are polynomials of degree /*, to which case the 
more general one is reduced. 

These results lead in §6 to the determination of the nature of the elements 
of G(x) and //(x) in the remaining right and left half plane respectively; the 
asymptotic form is found to alter along certain critical rays. A complete 
description of the principal matrix solutions is thus obtained. 

In § 7 it is shown that the number of characteristic constants involved in the 
complete characterization of G(x) and //(x) and their connection by means 
of P(x)is precisely equal to the number of arbitrary constants in the coefficients 
a t/ (x) assumed to be polynomials, and that the characteristic constants are inde¬ 
pendent. Thus I have ventured to propose the following as a fundamental 
problem of linear difference equations: to determine whether there exists a linear 
difference system with prescribed characteristic constants. 

It would be interesting to know the exact relation between the characteristic 
constants and the monodromic group constants of the differential equation yielded 
by the Laplace transformation. 

The method contained in the present paper is of wide generality and applies 
if the coefficients a„(x) have the character of rational functions only at x = oo, 

•These Transactions, vol. 12 (1911), pp 99-134. 
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in all cases where there exist » formal solutions. Carmichael (loc. cit.) h» 
studied this case to a considerable extent; in particular he has proved the exig¬ 
ence of analytic solutions and has partly discussed their asymptot.c form ... 
direction of the axis of reals. 

§ 1. The Determinant Limits. 

It is apparent that the given linear system (1), when written in the matrix 
notation (4), (4)\ possesses the two symbolic solutions 

A(x-l)A(x-2) •••, A-'(x)A~'(x + 1 ) •••• 

These will converge to limit matrices in certain special cases, for example if 
«<,(*) haa tbe form 

in this case the above symbolic solutions will yield the complete solution of (1). 
In general however these expressions must be properly modified in order to be 
made to lead indirectly to a complete solution of (1). 

Let us first make precise the limitation which is to be made on the coefficients 
a (*) in equations (1). If the roots of the characteristic equation (3) are dis¬ 
tinct, and if none of them are zero, there will exist precisely n sets of series 

*, ,(*)=*"(*«- )'*"{*«+*' + ••• |’ 

+ “x +•••}’ <•/ = >. 


( 6 ) 


Xx)~x-‘( Pl e-y* | •••). 


each of which constitutes a formal solution of (1). The proof of this is a matter 
of direct reckoning, and tbe fact will be taken for granted.* The constants 
Pr Pi' • • • i P n are the of the characteristic equation, and furthermore we 
have the determinant 

(T) __ + 

* The formal solutions of a single linear difference equation of the nth order have been fre¬ 
quently used (for example see Horn, Crelle’s Journal, vol. 138(1910), pp. 159-191) and 
admit of easy extension to a linear system. The formal solutions may be obtaioed by direct 
substitution if tbe roots of the characteristic equation are distinct and different from zero ; tbe 
series expansion 

( 1+ I)— 

enables one to make a comparison of coefficients and thus to determine the constants of the 
formal solutions. 
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The restrictive hypothesis which we shall make is not that p t , p t , • • •, p H are 
distinct but only that there exist n formal solutions (6) with the specified prop¬ 
erty (7). The quantities p,, p J% • • • , p A must of course still be the roots of the 
characteristic equation, and none of them can be zero. We shall choose our sub- 
scripts so that 

(8) Ia|S|aI» — s|aI- 

Without loss of real generality we may add the further condition that the 
linear homogeneous difference equation of the nth order at most which g x (x) 
satisfies, and which is obtained from (1) by elimination, is actually of the nth 
order. 

The elements s IJ (x) form a matrix S(x) which is a formal matrix solution of 
the equation (4) 

S(x + l)=A(x)S(x). 

Let us compute the elements s' it (x) of the inverse matrix £“'(x). The deter¬ 
minant | £(x)| may be written 

*•■*(/>,p. •••*>.«-■'...J. 

The element in the ith row andyth column of the inverse matrix is the quotient 
of the cofactor of the element in the yth row and the ith column of this deter¬ 
minant by the determinant itself, and is therefore given by a formal series 

(9) «;,(x)=*-**(#>,«-* )-*-'• {^+• • • J • 

Let T(x) denote any matrix obtained by breaking off the elements of S(x) 
at the &th terra, or more generally by replacing a tf (x) by convergent series £ v (x) 
of the same form as s i; (x) and having the same first k terms. Write then 

(10) T(x+l) = B(x)T(x). 

The matrix Z?(x), as thus defined, is clearly a matrix of functions 6 tf (x) each 
of which has an expansion like a^(x) atz = co with precisely the same first k 
terms. For if we compare the formulas 

B(x)= T(x + l)T-'(x) and A(x)= S(x + 1) S~\x), 

we observe first that the element ^(x) of T~ x (x) is given by (9) to the first 
k terms, and that in consequence the right hand members of both the equations 
have the same first k terms. In consequence we may put 

(U) A(x) = B(x) [1 + xr k C(x)] , 

1 being the identity matrix, and the elements of C(x) being analytic at x = oo . 
It is to be noted that the determinant formed from the principal coefficients of 

Tr*o«. Am. Mtth. Soc. II 
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A (x) and *(»). namely |a,|, cannot be zero, for that implies that zero is a 

root of the characteristic equation. . - 

In this paper it is necessary to distinguish sharply between two kinds o 

asymptotic representation of a function by a series of the form (6), say 

Let ? ( x) be the given function; if for each k the difference 

/ *'> s“»\ 

g(x)x—{pe- )-*- “ [• + T + •' ’ + J? J 

becomes uniformly small of order ar* for lim * = on in a certain region, we say 
that g(x) is asymptotically represented by t{x) in that region, vith respect to 
x; if on the other hand the difference is uniformly small of order tr* (where 
X= :U + \Z~--lVy u, v being real quantities) for lim v = oo in the region, we 
say that the representation is with respect to v. In this connection two 
important inequalities are 

<■» 

for x in the left half of the complex plane, and 

2 


1 2 [ 1 rrl 


^ 3 ) VI- - \ . .. . 1 -, + A I (* = 2 )' 

for x in the right half of the complex plane. To prove the first of these we note 
that when x lies in the left half plane u is negative, so that 

|x - k|* - (u - «•)* + »• > U* + «* + S, 

whence the sum of the given series is less than 

S(|*l’+'7ji* < r pp (l«P+ *’*)" *T 

Again, for x in the right half plane, our series may be written 

- 1 ~ 1 1 ^ 1 

If each quantity |u — v\ be replaced by the last of the integers 0,1,2, • • • 
which does not exceed it, the terms in the last series are not decreased, and hence 
one has 


nm 


1 


£ 


M*-* I* 


k-2 


1 2 fl f d* 1 
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But the last integral reduces to tt/ 2|u|, and thus the second inequality holds 
also. 


We may now proceed to the existence theorem: 
Theorem I. Form the two sequences of matrices 


P(x)=A(x-\)A(x-2)-.-A(x-m)T(x-m), 

(14) (m —l, 2, •••)• 

Q m (x)= A~ x (x)A ! (x-f 1 ) • • • ./l \x + m — \)T(x+m) t 


Every determinant formed from the. first X columns (X = 1,2, • • •. n) and the 
ith.jth , ..Ith rows of -P,( *)[&»(*)] converges, for k sufficiently large , to a 
definite limit function [v w ...,(x)] » independent of k , as m becomes 

infinite. This function is analytic throughout the entire x-plane excepting at 
points any number of units to the right (left) of poles of the elements of 
A (x) [ A~ l (x — 1)] and at these points may have a pole. The asymptotic 
form of w 0 ...,(x) [tty...*(*)] in the vicinity of x = oo is given by the corre¬ 
sponding determinant «„...,( x) °f $( x )* witfl r ^pect to x in any left (right) 
half of the plane , and with respect to v in any right (left) half of the plane. 

Proof. We shall first prove the theorem in so far as it relates to the case 
X = 1, and then show how the cases X = 2, 3, • • •, n may be reduced to this 


It is necessary first to show that the elements of the first column of P m (x) 
converge to definite limit functions w,(x), w a (x), • •., u n (x) as m becomes 
infinite. 

The matrix P m (x) may be written 
p m (x)-nx)p m ( X ) 

= r(x)![r- l (x)^(x-i)r(x-i)][r- l (x-i)^(x-2)r(x-2)] 

... [T~'(x- m + \ )A(x — m) T(x — m)\). 

It is sufficient to prove that the elements of the first colurau of P m (x) will con¬ 
verge, inasmuch as the elements of the first column of P m (x) arise from com¬ 
bination of the rows of T(x) with the first column of P m (x). The matrix 
P„(x) is a product of matrices, the type of each of which is 

t-'(x + i)A(x)T(x). 

By (10) and (11) this reduces to 

l + ± T~'(x)C(x) T(x). 

Let us examine the second term. The element in the ith row and Jth column 
of T~ l (x) will be given by an expression like that for s[ y (x) in (9) with the 
same first k terms; the elements of C(x) are analytic at x = oc , the element 
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in the ith row and jth column of T(x) will be g.ven by an express,™ l.ke that 
for s (X) and with the same first k terms. The element ,n the uh row andyth 
column of the product T~'(x) C(x) T(x) has therefore the form 

(?/)’ af.-n times a function analytic at x = oo. 

Choose first k so large that it exceeds every difference r, - r. in absolute value 
by more than an assigned integer </52, and we see that the second term may 

be written 

(15) J*0(x) with elements **(*)-(^)+«(*)• 

Tho function ^(x) is finite in the vicinity of x = co, at least if x be restricted 
not to pass the positive axis of_ reals. 

By means of this notation, P m (x) reduces to the form 

('+f^ry 8 <" 1 >)(‘ + C^? 81 *- 2 ’)'' ( 1+ ' 


or more fully, 

(i6) 4 <4i . 

+ £ -*.>«(*-*.>+•••• 

It is the elements of the first column of this matrix which must be shown to 
converge. According to the notation of (15), the ith element is 


But <p,j(x) is limited in the vicinity of x = oo and is less than some constant 
M for'l x\> R r Therefore if all the points x - 1, x - 2, • •. are exterior to 
the circle |x| = i?,, which is true if x lies in D of fig. 1, the typical element in 



the (/ + 1 )th term of (17), 


Fio. 1. 
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1 1 _ _ 1 (PrY'* ( pA"" (PlY'" 

(x - k t y (x - k 2 y'" (x - k l y\p i ) \p r J \pJ 

x <M*- *.)*M*- K)- • • - K)i 

is less in absolute value than the quantity 

(p. y 

\p,J (*-*■)*• ••(*-*■)* 

This inequality depends on the fact that we have 




!(£)■(£ mr 



as a consequence of (8), and the fact that &, , k t , • • •, k m are an increasing set of 
positive integers. Hence each term of (17) does not exceed the corresponding 
term of 


w + |(*y 


r M (P,Y\‘f' 

X I* — I' \pj |x-i,|'|x- ^1' 


in absolute value. 

Allow m to approach infinity. The expression (17) becomes a multiple series. 
Since the sum of the terms after the first in (18) approaches the limit 



it is obvious that the elements of the first column of P m (x) and therefore of 
P m ( x ) converge absolutely and uniformly to an analytic function in the vicinity 
of points in D. Moreover as we may write 


A(*-l)A(*-2)... A(x-r)P m _Xx-r), 

and as we may take r so large that x — r — 1, x — r — 2, • • • lie without the 
circle |x| = i?, , the elements of the first column of P m (x) will converge uni¬ 
formly in the vicinity of any point of the plane excepting those only at which 
one of the elements of A(x— 1 ), A(x — 2), ••• has a pole. Hence the ith 
element of the first column of P m (x) converges to a function u t .(x) which is 
analytic except at the points congruent to (any number of units to right or left of) 
the poles of the elements of A(x) on the right. At these points u.(x) may 
have a pole. 

It may easily be made evident that the definition of w,(x), w 2 (x), • • •» u n (x) 
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is independent of the choice of * and T. For if k' is any second choice of k 
and T(x) is the corresponding function T(x ), we may nevertheless write 

/>'(*)« T(x)l[T-'(x)A{x-l)T(x-l)] 

If the elements of the first column be expanded in a sum like (17) and m be 
allowed to become infinite, the resulting multiple series for u.(x) will be terra 
for terra identical with that obtained before. 

It remains to discuss the asymptotic nature of t/,(x), •••»"„(*) for large 
values of Ixl. The limit u 4 (x) of the ith element of the first column of P m (x) 
differs from 8,, by a quantity less than (19) in absolute value if x lies in D. 
This quantity is itself less than 



in absolute value. If now any line be drawn paraUel to the axis of imaginaries 
and if x be restricted to lie in D to the left of the line, by inequality (12) the 
last factor will be of the form M(x)jx'-\ where M(x) is limited. Thus we 
may write 

for x in D and to the left of this line. Likewise if x is in D to the right of 
such a line, by inequality (13) we may .write 

But the relation P„(x) = T(x)P m (x) gives at once 


T"l 

If in this equation the first of the above expressions foBU,(x), t/,(x), • • •, m„(x) 
be substituted, we find at once 

The functions t tJ (x) are the same in form as «^(x) up to the kth term. Substi¬ 
tuting in their expansions we obtain 

«,(*) = I (*« + X + • • • ) + £ * l?=r (■'<• + ¥ + " • )) • 

The second terra in brackets can clearly be made infinitesimal in x of as high 
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order as may be desired by taking &, and therefore d , large enough. Consequently 
u.(x) is represented asymptotically by a 4 l (x) with respect to x to the left of 
any such line and likewise with respect to u to the right of such a line. The 
part of the theorem that relates to the case X = 1 is proved. 

It is clear that if the subscripts 1, 2, • • n had not been so chosen that (8) 
was true, the only modification would be that the elements of any ith column of 
P m (x) for which the quantity p { had a maximum absolute value would 
converge. 

The case X = 2 will next be discussed. This reduces at once to the case 
X = 1 inasmuch as the two-rowed determinants uJjJ(*) formed from the ele¬ 
ments of the first two columns of P m) (x) which are in the ath and 0th rows 
(a C0) are related to a certain difference system precisely as the elements of 
the first column are related to the original system (1). 

In fact, form the system with general solution 


( 20 ) 


9n\x) 


g'S*) 9%( x ) 

9A*) *J(®> 

Here g\ (x), g\(x)> • • g m (x) and g[(x) % g" t (x ), 
solutions of (1). One finds by the use of (1) 


= 2. «». 


» 9n ( x ) are a °y P air ol 


(21) ^ = (* + 1)“ 

where 

“«:».(*) “ »»(*)«,.(*) - “«(*)<»»(*)• 


Accordingly we have Jn(n — 1) 
type (1) to determine the functions 
expansions 

!**.(*) »«(*) 




V*) */■(*) 


linear difference equations of precisely the 
g,j . (x). It is clear that, the sets of formal 


= & mM {p k p,Y*'* n ( ( « a s Jt - $ a s M ) + terms in x ~ x , x" 2 , • • • }, 


k and l fixed, will satisfy (21). There are Jn(n - 1) sets of these, and further¬ 
more the determinant of the constant terms 


9 u*Ji 0 

is precisely equal to d*~ x and is therefore not zero by (7).* Hence the system 
(21) satisfies the restrictive hypothesis made at the beginning of § 1. 

We are now in a position to apply the result already obtained. In matrix 
notation, equations (21) may be expressed as 

_ G t {x + 1 ) = A t (x)G t (x). 

* Pascal : Die Determinanlen, p. 87. 
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Also let Tj(x) be the matrix which the formal matrix solution S. : ( x ) becomes 
when the formal series are replaced by convergent series which agree with them 
to the £th term. The elements in that column of 

= Afx - l)A.(x - 2) - - ■ - m)Tfx - m) 

for which the quantity p L p, in the formal solution which appears in the same 
column of &(x) has a maximum absolute value will converge ; this results 
from an application of the preceding result for X = 1 to the system of )»(n— 1) 
equations. Equations (8) show that we may take k = 1, 1=2 and it is also 
clear that we may take the elements of this column of T t (x) to be 


W*) 


*.,(*) 

(*(*) 



W*> ’ 


The elements of this column of /^(x) converge to functions analytic 

everywhere except for poles at )>oints to the right of the poles of the elements of 
A t (x) and congruent to them. Moreover these functions are represented 
asymptotically by s U: lt (x) throughout the entire plane, with respect to * to the 
left of any line parallel to the axis of imaginaries, and with respect to v to the 
right of such a line. 

But the sequence w = 1 »2, ... is identical with the sequence of 

determinants formed from the elements of the first two columns which 

are in the ith and jth rows of P m (x). In fact the first member of both 
sequences is t IJ:tz (x), and in general /#&'(*) is derived from />£/,*(*) just 
as i/y +,> (x) is derived from wj^x), namely we have 


/>;7m5 , (*)= £«*.*»(* - 1 )/"*?!«(* -1 )• 

i: I 


E «.,(*- 1 Z "<,(* -1 -1) 

r-l rmt 

± «„(*- l)// r 7t*-l). t "A* -1 )K5(* -1) 

T =1 T=l 

= 1 ) , *3u > (*— !)• 

4.1 

Thus the theorem is true for X = 2 also. Strictly speaking, we have only 
obtained functions u iJ (x) for i < j; but we have obviously such functions for 
all i and j ; these may be obtained by means of the equation u.. (x) = — u Jt (x). 
In like manner the theorem can be demonstrated for the remaining values of X. 

The existence of the functions ,(x) and their properties maybe estab¬ 
lished in the same way. The role of right and left in the x-plane will be inter¬ 
changed and the equation (4)' replaces (4). 






487 



. 1911] 


LINEAR DIFFERENCE EQUATIONS 


255 


The functions ,(x) and r Vm , ml (x) will be called the determinant limits 
on account of their origin. 

In the case that the restrictive hypothesis is satisfied and all the roots p of 
the characteristic equation are equal, Theorem I supplies a complete existence 
theorem, every column of elements of P m (x) and Q m (x) converging to an ana¬ 
lytic solution of (1). These conditions are met if one has 

a*?’ o'-*’ 

»<;(*)= S ‘J + -£ + i? + ■■■• 

In general however Theorem I supplies only one solution directly, and that is 
furnished by 

?.(*) - "■(*)• ?.(*) - “.(*) . ?.(*) - «.(*)• 

This set of functions must be a solution since the definition of w 4 (x) gives us 

n 

w 4 (x + l) = lim £ a lr (*)...a r ,(x-m)< | |(x-m) 

r-l 

§ 2. The Solutions Associated with the Determinant Limits. 

The functions u,(x), t/. y (x), ••• which have been obtained behave as 
though they were constituent determinants of a matrix of solutions of (2). 

Theorem II. There exist n solutions j 7 t/ (x), g tJ (x), • • •» 9 n ,( x ) 

A J; (x), •••. A, y (x)](jea 1,2, • • •» n) of (1) over any discrete set of values 
congruent to a given value r 0 of x, such that every \-rowed determinant formed 
from the frst [/as*] \-columns of the matrix £(x)[//(x)] and the ith, jth, 

• • •, Ith rows will equal «<,...,(*) [fy. ..,(*)] *Acse points. The frst solu - 
lion g„(x), g„(x), • • •. ?.,(*)[*,.(*). **(*)• •••,*..(*)] is uniquely 
determined , the second solution « 7 „(x), ^„(x), •••» < 7 „,(x) 

^ 2 ,n~i( x )* is determined up to an additive term obtained by 

multiplying all the corresponding elements of the first solution by an arbitrary 
function of period 1, and so on, any one of these solutions being determined 
up to an additive term linear in the corresponding elements of the preceding 
solutions with arbitrary multipliers of period 1. 

Proof. The theorem is true in so far as it relates to X = 1 since w,(x), 
Mj(x), ••■»«„(*) has been seen to give a solution of (1). We may write then 

?..(*) = “.(*)• ?■(*) = «.(*). •••>?..(*) = “.(*)• 

Next we shall demonstrate that there exists some solution < 7 i,(^)* • • •* 

g„ 2 (x) such that for all i and j 
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( 22 ) 

For 

means of the identity 


««(*)• 


?«(*) 

For this purpose we shall recur to the sequence of elements of PJx). By 


*£<*> jC(*)| 


o, 


I/5?(») /£’(*) 

we infer, by developing according to the elements of the third column and letting 
m become infinite, that 


««(*)“»(*)-“«(*)“>(*)+ (i.i, t = 1,2. ••.»)• 

The n quantities jy,,(*„), <7 a (*„), • • •. ?.,(*.) uniquely determine a solution of 
(1) on the set of points congruent to x g . Choose these n quantities so that (2-) 
is satisfied for some fixed i such that J 7 n (ar 0 ) 4= 0» au( * * or ever y i* ^ ^ ie 
rest of the equations (22) will then be satisfied, as is evident from the pre¬ 
ceding equation. This gives us 


!?<,(*) ?<*(*) 


&,(*)- 


?>.(*) M x )' 

for x = .t„, which reduces to 




?*.(*) ?«(*) 




»«(*)" 


|sr>.(*) $*(*) 
&.(*) &*(*) 


for x = s 0 . Hence g iJt (x) and u (J (x) are equal at i = x 0 for all i and j. 
But they are both constituent elements of a solution of (21), and therefore will 
agree at all congruent points, since only one solution of that equation exists on 
the discrete set of points congruent to x 0 whose elements have given values at 
this point. It is necessary to make an exception of points congruent to the 
poles of A (x) or yl~‘(x — 1), and such points are excluded. 

The method by which this second solution was obtained clearly allows us to 
add to each element g, t (x) a terra of the form p{x)g il (x) % where p(x) is of 
period 1. 

One may now in an entirely analogous way define a solution < 7 13 (x), g^ (a*), 
. • g n j(x) on the set of points congruent to x 0 . The identities to be employed 
in this case are 


«<>*(*)*.(*) - * w .( *)«*(*) + «.«(*)“,(*) - «>*(*)*.(*) = °- 
The function g a (x) is only determined up to an additive term 

!»(*)&,(*)+ $(*)?*(*)• 
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By proceeding thus the theorem may be proved in a succession of steps in so far 
as it relates to the elements g tj (x). 

The part of the theorem that relates to the elemeuts h tJ (x) is proved in an 
entirely similar way. 

The solutions g XJ (x) % •••»?„,(*) [*„(*), • • * * (»)] ma y be said to be 

the solutions associated with the determinant limits u ,(*)» u fJ (x) • • • [»,(*)♦ 

»</(*)•••]• 

It is important to obtain explicit formulas for these functions ( x ), and 
i dc shall show that it is possible to obtain the solutions f 7 ly (x), 0 v (x), •• •, 
g nJ (x) for j = 1 , 2 , . ..,n in terms of u,(x), w <y (x), •••» a,l< l 

operation £ defined bp the condition 

Z +(0-S*(0-*(»)- 

I ml-l Img 

The functions g xl (x) % •••. ^(x) are equal respectively to ti,(x), •••, «,(x). 
It will accordingly be sufficient to obtain formulas for ^ u (x), • • •» in 

terms of the solutions g XJ (x) % •••, g^ (x)(j = 1 » 2 , •••, k — 1 ) and the 
determinayt limits. 

Form the determinant 


n 

H 

<*> 

co' 

©1 

'-r" 

»..(*) 

?..(*+ *) 

?,:(* + !) 

• • • • 

••• 9,A*) 

... ?,,(>:+ 1) 

• • • • • 


j7 n (* + i-l) 

9,A* + * ~ 1 ) 

... 0„(* + *-l)i 


If now in place of the elements g X} (x + m) (m > 0) we insert their value as 
obtained from (1) 

w 

(24 )?,,(* + »»)= Z a„(*+m-l)a r .(x + m- 2 )...a,,(*)< 7 ,,(a:), 

in terms of the functions g XJ (x), • • •» g nJ (*)» we see that the above determinant 
may be expressed as a sum of terras each of which contains as factor a deter¬ 
minant which reduces to one of the determinant limits of order k. In fact we 
have obviously 

(25) **>(*)- Z {«„(^)[Z"„(*+l)a,.(*)]---j«,....(*)• 

Hence this determinant is to be regarded as a known function. 

But the identity (23) may also be looked upon as a non-horaogeneous linear 
difference equation of order k — 1 in * 7 u (x), and may be written 

(20) »‘- , ‘(*)'/..(*+*-»)++-(*)ff,.( ; '+ i - 2 H"-±«“- , ’(*+l)?i.(*)=^ , (*)- 


The associated reduced homogeneous equation obtained by equating the left 
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hand member to zero lias the solutions [see (23)] ?„(*), • • * • 0i.*-«(*)» w,uch 
are linearly independent unless $—"(x) is zero. By a method analogous to that 
of variation of parameters for ordinary linear differential equations, let us put 


t- 


(27) 


?„(*) = E ?„(*)(,(*) 


and determine /,(*), !,(.*), • • •. *.-,(*) b - v < he e 'l" ations 

'£?„(* + l)A/(*) = °. 


>-» 

l- 


(28) 


£s , u (*+ 2 ) A/ /( J: )“ 0 ' 




x - I 

+ *“ 1 = # 4 - * ’(a;) * 


where as usual A <f> denotes the difference <f>(x + l) - <f>(x). There are here 
£ _ 1 linear equations to determine the k — 1 unknowns A/,(*),•••, A/ t _, (x), 
and their determinant is precisely 0*-"(x + 1). Unless this determinant is 
identically zero, we have accordingly 


A/ tm\ 


in which m } t _,(x) is the cofactor of the element in thejth column and last row 
of the determinant ^*" n (x +1): 

(29) ± 5 r ,i(*+ 2 ) fl p ij-»(*+ 2 ) 0 »j*i(*+ 2 ) y*.*-»(* + 2 ) 

.. ‘I 

1) 0u-i(*+*“l)0u*i( a ‘+*~l) *•* ?i.*-i(*+1) i 

The resulting expression for y u (x) is 


(30) 


“ JL9v\ x ) L + i)‘ 


It remains to show that J 7 u (x) as thus defined is a solution of (26). We have 


;=• 

i-i 


?„(* +1) = Z [(»i>( a: + 1 ) ? >(*) + fl'i,( :c + 1 ) A, >( :c )] 


(31) 


■!Zy I y(* + 1 ) , ;(»). 
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?1 ,(.- + A-i) = 'f [(?„(> + *- !>(,(*> + a ,,(* + *-1)4//*)] 

' ‘ «-l ff k, tx) 

Hence, if we multiply the last of these equations by the one which 

precedes it by ^r(x) aiul so on, and add corresponding members, the left hand 
member of the resulting equation will be precisely the left hand member of (26). 
Also all the terms except the last of the right hand member reduce to zero by 
reason of the fact that g„(x), • • •, ?,.*_,(*) are solutions of the reduced equa¬ 
tion, and there remains only a term 0“(x). Therefore g u (x) is the desired 
solution : furthermore the formula (30) does not determine j/ u (x) uniquely but 
only up to a sum of terms 

Pi(*)<ju( *) + ••• + ;>*-. (*).? «.*-i(*)* 
where />,(*), • • •, ;> t _,(x) are arbitrary periodic multipliers since the operation 
* s determined only up to an additive periodic term. But the result stated in 
Theorem II shows that g u {x) has precisely this degree of indeterminateness, and 
hence the most general determination of g u (x) is furnished by (30) unless 
0 k -"(x) is identically zero, and this is an obvious case of failure. 

When a choice of g n (x) has been made, the remaining functions • • •, 

g hl (x) which make up the £th solution are determined. In fact the equations (24) 
for tn = 1,2, • • •, n form n linear equations in g x {x), • • •, ?„(*) of determinant 
not zero, since in the contrary case we should obtain a linear homogeneous relation 
between g x (x + 1), • • • ♦ g x (x + n), which contradicts the condition placed on 
r/,(.r) at the outset that it satisfies an equation of the nth order. By solving 
these equations we obtain 

(32) g,(x) -*„(*)?,(*+l) + <r a (*)j' 1 (* + 2) + ... +*,.(*)?,(*+») 

(f=l, 2, •••,».), 

where the functions <t,j(x) are rational in x. These equations determine 

!/;(*)♦ ’ * US X ) in lerms of 0x( x )- 

That none of functions ^’’(x), •••, 0 K, {x) can be identically zero maybe 
proved as follows: In the first place 0 4, (x) has a definite asymptotic form in 
the entire x-plane, namely that obtained by replacing tt Tr ...(*) ty «,»...(*) in 
the right hand member of (25). This is a consequence of Theorem I. But this 
form is 

«„(*) •“ *n( x ) 

*!.(*+!) ••• *«( I + 1 ) 

| Sll (* + *-l) *,.(*+*-1) 
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as may be shown readily by .neons of (23). liecalling now the explicit form of 
the series «„.(*), we obtain 

(33) ~ ••(A'"")'*”' 

Since d‘(x) is represented asymptotically by this expression with respect to x 
to the left of any line parallel to the axis of imaginaries, and with respect to ti to 
the .Hit of such a line (Theorem I), it is obvious that unless rf„ <*,'>,••• are all 
zero 9 n (x) does not vanish .at a great distance from the positive axis of reals. 
But’these constants can not all vanish. Otherwise the above determinant 
vanishes (formally) and *„(*).*„(*) *te linearly dependent, and accord¬ 
ingly, by (32), *„(*),<■•.*»(*) (' “ I - 2 .")• are linearly dependent, m 

contradiction to (7). Thus (80) is a valid formula. 



§3. The Intermediate Associated Solutions. 

The application of the formulas (30) and (32) leads us immediately to a set 
of intermediate associated solutions 

?;*(*)• •••*£*(*) (i=i, 2,.••,*.), 

analytic above and below lines A cc and D co parallel to the axis of reals , and 
which have the property that 

9\k( x )**»(*)' *.*(*) 

with respect to x in any left half plane and with respect to v in any right half 
plane. 

We take 

?;,(*) - *.(*)• ••• * £.(*)=*■(*)• 

and this first solution has the properties stated, by Theorem 1. We now define 
the solutions ?;.(*), •••* 9 mt (x)i ‘ in succession by 

choosing the summation operation 2 on <£(x) to be defined as 

-*(*)-<*>(* + !)- 


and employing formulas (30) and (32). It will suffice to show that a solu¬ 
tion tf XJ [x) % • • •, f,(x) f° r j = £ "ill exist and have these properties under 
the hypothesis that the same is true for j = 1,2, • • •, k — 1. The formula 
(30), which serves to define « 7 ' lt (x) if |p*_,| >|f>*|» is then 


4-1 




We have written mj t _,(x) to stand for what m J t _ t (x) becomes when g' {J 
replaces g. r We shall first prove that the k — 1 series which appear here are 
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convergent. To this end we need to compute the asymptotic form of the typical 
term of the above series 

***(*)■£ *-*(*) 

1)’ 

and this is done by merely substituting the known asymptotic forms for the 
d'n as given in (33) and at the same time the known form of the elements 
m'j . This latter [see (29)] is clearly given by 

*-,(*)- r • • • (pj-t ^ yow)' 


as a result of our hypothesis. Combining (35) and (83) we find for the above 
term the asymptotic formula 


(35) 




m 


<ii 


(36) 


***(*) tn j. *-i (*) 

^*- , )(x)^ 4 - , >(*+ i) 



The integer is zero unless d t _ t = 0. 

Since we have |pj > \p k \ for j < k except when |p 4 ._,| = |p t | [see (8)], the 
terms in (34) clearly diminish in approximately geometrical ratio for v large 
enough, and in consequence the series converge uniformly in the vicinity of such 
points. Thus g\ k (x) has been actually defined and is analytic above and below 
suitable lines Aoo and Boo . 

By the aid of relations (36) we may now determine the asymptotic form of 
each of the series in (34). In the first place, if each term of the jth series is 
replaced by its asymptotic formula, the series becomes 


S(S Y’ {x + + ^ (”> + 4", + • ■ • 

The error committed in the final bracketed series by breaking off at the Zth 
term is of order (r -f- v)~' if x lies to the left of the axis of imaginaries, and of 
order v‘ if x lies to the right of this line, since the imaginary coefficient v of 
x ■+■ v is the same as the imaginary coefficient of x. These bracketed series are, 
of course, finite above and below suitable lines -loo and Boo. 

Now separate the series into two parts: first the sum of the first m terms in 
which v < 4 |:r|, secondly, the remaining terms. 

In this first set of terms we have 

(* + *)' = .r*(l+^ + 


where the error committed by breaking off the series at the XrtH term is of the 
order (v/x) k . Hence the first m terms of the above series may be written 
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where Mj(x) is limited. Now the terms involving M.(x) are of the order a?-*, 


since 


As m becomes large we have also 

f^\Pj) Y pj-p * 

o all powers of .r since the difference in absolute value is 


constant. 


i —— 


i-* 


Hence the first m terms have the asymptotic form 

(37) (g) W |^ + 3 S- ,+ -| 

witli respect to .r for * to the left of the axis of imaginaries, or similarly to the 
left of any parallel line. The terms after the mth diminish in approximately 
geometrical ratio for |v| sufficiently large, and all contain a very small multiplier 
( PklPj )"• H ence second group of terms does not affect the above asymptotic 
form. 

Substituting the above expression in (34) and at the same time the known 
asymptotic formulas a u (x) for g' Xj (x)% we find 

-/;.(*) - E *-(*>, «• y * {«„ + ?+•••) 

or multiplying out, 

(38) ?;,(*) ~ <* «-* y ** |f*+ ** + ••• | 

with respect to x in any left half plane. 

By means of (32) we now obtain the asymptotic formulas for 
g' nk (x). Thus we obtain n formal expansions which must of course be the con¬ 
stituent elements of some formal solution. The possibility that f lt , £,**» • • • are 
all zero must be excluded since from that fact we should infer that all the coeffi¬ 
cients in the asymptotic expansion of » 7 ' 4 (x)(i = 1,2, •••,«), and consequently 
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of &°(x) y were zero. The formal solution thus obtained must coincide with 
«■*(*)» * * *♦ «.*(*) at least Pi* * * •» P. are distinct. For it is apparent that the 
only possible difference lies in a constant factor. By definition of x) in terms 
of ?„(*) = Uni*)* • • •* 9 xk = <j \*(*)♦ and >n virtue of the fact that the asymp¬ 
totic form of 0 l, (x) has been shown to be that obtained by replacing these func¬ 
tions by «„(*), ...» a, t (a-), it follows that this factor must be 1. Hence we 
have in this case £ {k = ft 1 ; = <?» • • •• Under our restriction, these equa¬ 
tions are algebraic identities persisting in the exceptional cases when />,, • • •» p n 
are not distinct also. 

Thus we have demonstrated the existence of a *th solution having the desired 
properties provided that |/> t _,| and |pj are not equal. 

We shall now indicate briefly the modifications that are necessary in the case 
\p k _ x \ = |pj. Suppose that p k is one of a group of roots 

P*+ 1 » •••* Pb 

of equal absolute values. If p t = p., we have seen how to construct the Icth 
solution; if p k = p a>| , one may invert the order of p. and p.+, in (8) and thus 
construct a {k + 1 )th solution possessing the requisite asymptotic properties, and 
so continue to the /3th solution. It is however necessary to know that the solu¬ 
tions thus constructed are associated with « 4 (ar), t/, y (*), • • • as stated. Accord¬ 
ing to the method used in §2 there certainly exist ath, (a + 1 )th, • • •, £th solu¬ 
tions over congruent sets of points which satisfy ull the limit determinant 
relations, obtained by interchanging the order of p«, • • •, p fl in (8); for the alge¬ 
braic condition that this hold will be satisfied at all the approximations since 
these are actual determinants, and hence will be satisfied in the limit also. On 
this account at least certain determinations of the sum formula employed will 
give a &th associated solution. Since however this &th associated solution is 
determined up to a linear combination with periodic coefficients of the solutions 
before the ath, and the formula which gives it indicates a solution determined up 
to a linear combination with periodic coefficients of solutions before the ath, the 
general solution given by the sum formula is a &th associated solution. Thus 
the solutions, obtained in this exceptional case as indicated, are associated with 
the determinant limits. 

It would be of considerable interest to determine the nature of these inter¬ 
mediate solutions near to the axis of reals. 

Considerations of symmetry show that there exists a set of solutions A' ly (a?)» 
. • •, h' nJ (x) (j = 1,2, • ••,») having the properties stated. 

§4. The First and Second Principal Matrix Solutions. 

We shall proceed now to prove the existence of certain two remarkable 
matrix solutions G(x) and H(x) of (4), and this will be done by means of the 
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formulas of § 2 and the intermediate solutions of § 3. It is clear that in order 
to define any matrix G(x) it is merely necessary to choose properly the opera- 

tions £,=, which occur in ( 30 )’ We sha11 US€ a contour inte 6 ral t0 effect these 
summations, similar to that which Carmichael has used, writing 

4>(t)dt 


ImM "I 


*-/ 


< 

/ 




Fio. 2. 


» —1(*—O — 1 ’ 

where X is an integer and L is- a path oo A D oo like that in fig. 2. The 
contour AB\s* simple curve which passes between x and x — 1. The summa¬ 
tion function 4>(t) is supposed to be analytic 
except at points at a limited distance from 
the positive axis of reals and within the 
region *>ABoc. Furthermore <f>(t) is 
supposed to tend toward zero along lines 
A oo , B oo sufficiently distant from the axis 
of reals, and in such manner as to make the 
above integral absolutely and uniformly 
convergent in the vicinity of any x not congruent to a point of L . In 
this case the function represented by the integral will be analytic at a 
sufficient distance from the positive axis of reals, and will be independent 
of the particular lines A oo, Boo or particular curve AB chosen. If we 

form the difference <) “ £r-,*(0 ifc wiU bc 6 iven b y an intc S ral 

over ABB A A , the integrand being unchanged. But within this closed curve 
the integrand is analytic save for a pole at t * x ; and by the calculus of residues 
this difference reduces to 4>(x). Hence this is a possible determination of 

Z, XO- 

Having made this preliminary study of the operation £, -rl we now proceed 
to the proof of 

Theorem III. There exists a matrix solution G(x)[/I (.t)] of ( 4) whose 
elements have the following properties: 

(a) the functions g tJ (x) [A y (*)] satisfy the conditions 


?«(*) = «<(*)■ 


9jx( x ) ?,.(*) 




A|,„_i(*) h lm (x' 

M*)l 

(6) the functions (ar)[A /; (x)] are analytic throughout the finite plane except 
possibly for poles to the right (left) of and congruent to the poles of the 
elements of yl(*)[i4“' (x—1)] ; 

(c) in any left (right) half plane g tj (x)[A tJ (x) ] is asymptotically represented 
by s i .(x) with respect to x. 
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Proof. The proof of this theorem will be made by actually obtaining the 
solutions < 7 n (x), •••, £.,(*) 5 •••; ?,.(*) t •••« ?.„(*) in succession. The 
conditions of the theorem are evidently satisfied by taking the first solution to 
be 

//„(*) = »/,(*), • • •, ?.,(*) = */„(*)• 

It suffices therefore to exhibit a solution g lk (x) % •••* ff mt (x) which satisfies 
the conditions of the theorem under the hypothesis that the solutions g XJ (x) y 
’ * ’» Vn) () (j < k) have been constructed satisfying all of these conditions. 
This is the way in which the theorem will be demonstrated. 

The formula (30) gives the most general determination of g lt (x) which is 
possible if g u (x) is to be the first member of a ktU solution satisfying restric¬ 
tion (a). We shall consider first the case |p*_,|>|p*|. We choose the 
operation as indicated above, and obtain 


(39) *.<•) — 2 #„(«> X» 

the integer \ h will be chosen to be the least integer as great as the real part of 


(40) 


1 


-=(logf>.-l°g<>,). 

-7T r — 1 


The functions g u (x) % • • •» g ml (x) are then obtained from (32). 

The first step is to establish the fact that the summation functions in (39) 
satisfy the restriction on the summation function Now 0' k) (x) y 0 ( '"' ,? (x) 

are analytic save at points to the right of and congruent to the poles of ele¬ 
ments a tj (x) by (25). Likewise, by the initial hypothesis and by definition of 
nij ,_,(») as the cofactor of the element in the jth column and last row of 
^ , *" l, (x+ 1) [see the form (29)], the singularities of ^.*_i(*) (,;™1» 2, 
1) arc restricted in the same way. Hence the summation function 

~0*-l>(t)0<- |, (* 

will be analytic at all points sufficiently far from the positive axis of reals, pro¬ 
vided that d k ~ v (t) does not vanish except at points within a certain distance of 
that axis. This fact is an immediate consequence of the relation (33) in which 
e/ t , df, ••• have been seen not all to be zero. It remains to show that the 
function <f>(t) tends to zero in such a way that the integrals appearing in (39) 
converge as stated for all positions of the lines A oc and D oo at a sufficient 
distance from the axis of the reals. In order to demonstrate this essential fact 
we make use of the intermediate solutions obtained in § 3. By Theorem II the 
general determination of the solutions g Xj (x) y •••, ) associated with 
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the determinant limits is given by any one of them plus arbitrary periodic mul- 
tiples of the preceding ones. Hence we have 


/in g(x) = rfi(*)j*(*) + • • ‘ + !7u-i ( x )Pj-t.A x ) + 0u( x ) 


i \ 


where the functions p„(x) are periodic functions of period 1. analytic for x suffi- 
ciently distant from the axis of reals since the determinant \g 0 {x )\- 
does not then vanish. Substitute these expressions (41), for i = 1, in m Jt (0 • 
A reference to (29) makes it clear that the m it *_,(0 may be written as a sum 
of products of periodic multipliers analytic for x sufficiently distant from the 
axis of reals and of factors »<*.,(£), where signifies as before that 

which becomes when g is put in place of g. Hence the integrals will 

converge in the manner desired if the expressions 


tend to zero in exponential fashion along A oo and B *. This is true \( A oo 
and B oo are sufficiently distant from the axis of reals, in view of the asymptotic 
relationship (36) which holds with respect to u iu the right half plane. 

The function g xk (x) is therefore analytic at a sufficient distance from the 
positive axis of reals, and by (32) the same may be said of the remaining ele¬ 
ments g u (x) % • • •, ?.*(*) which institute the *th solution. Working to the 
right across the plane by the aid of the original equations one proves readily 
that the only possible singularities of the functions of this solution will be poles 
congruent to the poles of the elements of A(x) and to the right of them. 
Hence (b) is true for the Ath solution. Also (<?) holds in so far as it relates to 
this solution since every solution ? u (x), •••.&*(*) determined by (30) had 
this property. 

Statement (c) has to do with the asymptotic nature of g u (x) % • • •, &,*(*). 
In order to prove (c) let us break up each integral in (39; into two others over 
contours L x and L 2 as follows: the contour L x is a fixed contour x A , B x x 
(fig. 3). If x lies above /l, x, L. consists of a loop-circuit to x which includes 

the points x % x + 1, • • • within it but not x — 1, x — 2, - It X lies 

between A, x and B x x. L. consists of a loop which includes x, x + 1, 
x + l within it, where x + / is the last of this series of points to the left of 
A, /?,. If x lies below B, x. L, is a loop-circuit of the same nature as when 
x lies above .1, x. Any combination of paths L x and L, is clearly equivalent 
to the single path x ABx of fig. 2. If x lies on A x or B x or on a line 
congruent to a point A x B , on the left, the two integrals are undefined. 

Let us consider first the part of the integrals contributed by A,. The inte¬ 
grands are periodic functions of .*• of period 1 and therefore the integrals will 
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also be periodic since the contour L t is fixed. This part of thejth one of these 
integrals may be written as either of the products 


I *-*.*,.» -II 
'Lx 






_*_ 

t)0 k -"(t + 1 ) c i.»_ 1 * 

_ & l) (t) m Jk ^ x (t) _ dt 

4 1 ) 1 - 


It is apparent that in the first of these the second factor remains limited as v 
becomes positively infinite, inasmuch as e 2 *‘" ,x tends toward zero; likewise that 
in the second of these the second factor remains limited as v becomes negatively 
infinite. Hence the part of g xk thus obtained is 


(42) C„ 


3, i(x)e , '*“ , '- , *7„(*)+ ••• + ?, Jw,» (*). 


according as x lies above vl,oc, between ^4,oo and 7?,oo, or below /?,oo. The 
functions f u (a?), • • •» are periodic and limited. This is obviously 

true unless x is in the vicinity of a point congruent to a point of the contour of 
integration. However it is true in this case also since each of the three expres¬ 
sions above is unchanged if the contour be varied continuously; of course the 
function q ik (*) is not one and the same analytic function in the three cases. 

Next let us consider the contribution C^ yielded by the integrals taken over 
Z/ 2 . Let us suppose first that x lies above A t cc or below B x oo . In this event 
the integrals in may be evaluated as a sum of residues, each of which may 
be expressed in two ways, as indicated in the following formulas 


(43) 

or 




V » t, (x + y)w J . t _,(x-4-y) 

h L **-"(* + *)**-"(* + * + 1 ) J 

v„ f »»(«■>•»)■<-.(»+») 

I »*-"(x + + * + i 
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The first form is the obvious one derived from equation (39> If we observe 
however that we have, by definition of »q_ 


!gr..(3e + *+1)---!7 i.*-i(* + «' + 1 )J 


*-l 


, , , , ?„(* + *• + 2)+ *'+ 2 ), 
£?■/*)'”*.*-(* + *')“.i 


• • • Jl.t-lO® ) I 

and then write in place of g„ their equals as given in (41), the resulting deter¬ 
minant may be simplified because the periodic coefficients are the same at 
k + i> + 1,® + v + 2, ••• as at *. By a combination of successive columns 
the determinant reduces to a new determinant identical with the above except 
that (j replaces g, i. e., reduces to 


Sffu (*>•!.*-■(* + *’)• 

y-i 

Thus the second of the above forms is obtained except that the two signs of 
summation will appear in inverted order. It is permissible to change tins order 
if all the infinite series converge. This is certainly the case as the second form 
is precisely that which defined g\ k (x) in § 3 [see (34)]. Thus we have for * 
above A x oo or below B x » 

(44) £*,-?'.*(*)• 

When a: lies between A x oo and B x =o to the left of A x B x we use a form like 
(43). We will agree to denote C by </,*(*) for the sake of convenience, 
because then (44) holds in all cases and g' lt (x) as thus defined will have the 
asymptotic form s lk with respect to x in any left half plane , and with respect 
to v in any right half plane. In § 3 these facts have already been proved for 
f n (x) as there defined, that is for x not between A , oo and B x oo . We need 
therefore to prove that g\ k (x) -«,*(*) with respect to x between A t cc and 
ao only. The discussion is entirely similar to a discussion that has been given 
in $ 3. In the first place the integrals in the expression g' Xi (x) for C L% may be 
evaluated as a sum of residues 


, if! , »‘\x+v)m lk _ x (x + v) 

(45) 0\k- x ) - — + „)^‘-')(x + p+ 1)* 

where * -f- / is the last point of the set x % x + 1» • • • to the left of A x B x . If 
this equation be compared with (34) one sees that the right hand members differ 
in that a finite sum replaces an infinite series. Proceeding precisely as we did 
with (34) we arrive at the asymptotic relation g' xt (x) ~ *,»(&) with respect to 
x by means of the hypothesis made that g xi (x) y • • • * g»i( x ) are asymptotically 
represented by s,;(x), •••,«„(*) with respect to x in any left half plane and 
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for j = 1,2, 1. The sole modification consists in this that instead 

of having an infinite set of terras in which those after the mth (m the greatest 
integer less than $|z|) do not affect the asymptotic form, one has only a finite 
set of terras in which however those after the mth do not affect the asymptotic 
form. 

The proof of (c) of the theorem iu so far as it relates to < 7 u (x), • • 9 nk (x) 
may now be given. Iu the first place, for x above A x oo, (42) and (44) give 

*„(*)« w;r '?..(*)+ ••• + *..(*)• 

The dominating term will depend upon the dominating term of 

g<3«A U i'-i + i»r?iXx, ... f + 


which begin the formal expansions. On dividing by the last of these, the 
exponents become 


2ttv/ 


-1 (a,. 


log Pi ~ l og Ph\ r 
2 irV- 1 / ’ 


..., 27 rt/-l 



log P*-,-l°gPA 0 

27TV/-1 ) 9 


The bracketed expressions here all have a positive real part less than 1: this 
follows from the definition of X u ., • ••, X t _, 4 . and the fact that |/>*_,|> |p*|. 
Hence the coefficients of x are represented by points in the upper half plane 
below the line v = 2tt. Therefore if x lies in the second quadrant, all of 
these exponents have a negative real part, and the last term dominates, that is 


«.*(*) 

for x above vl, oo and to the left of the axis of imaginaries, or likewise of any 
parallel line. 

For x between -1, oc and D x oo we have similarly 


!/,»(*) ~*..(*)fu(*)+ + *»(*) 

and the last term clearly dominates again by (8). 

For x below B x *, «y u (x) is represented by 

s„ (x)e ! * ,1 "- ,> J ~''9u( x ) +■■■+ s, .M+«,.(4 
The dominating term depends on the exponents 



logp.-logpA i f \ | 1 «gP> r j- 1 °gPA o 

W-I +- W _i ) r -°- 


The coefficients of x will in this case clearly lie below the real axis and above the 
line v= 27t, and if x lies in the third quadrant, the last term s u (x) again 
dominates. 

Hence in any left half plane ? K (*) is asymptotically represented by s lk {x) 
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with respect to and of course ?*(*), •••, 9 mk (*) are at the same t,me 
represented by #*<*). —* *-(*) ^pectively, as follows from (32) Thus a 
solution g lk (x), •••,?,*(*) exists possessing propert.es (a), (6), (c) unless 
| p I = |p 4 |, that is unless p k is one of a group p., P. + ,, • • •» P p of roots p ot 
equal absolute value. The same argument which enabled us to treat this excep¬ 
tional case in § 3 may now be applied here; that is, we may take p* as the first 
of this group without violating (8) and construct a *th solution which satisfies 
(a ), (6), (c) corresponding to each of these values of p. 

The second set of solutions A |y (x), • • •, A b> (x), (j = 1,2, •••.») ma y be 


obtained by considerations of symmetry. 

There is one and only one solution g tk (*),-• •, &*(*) [*!*(*)• ** *• 

(k=l, 2, •••, n) analytic except for poles and asymptotically represented 
by s (x), • • • * «,*(*) even 10 term* of the first order in any left (right) half 
plane The matrix G(x)[/l(x)] with elements ?<,(*)[*<,(*)] ' s termed 
th e first ( second) principal matrix solution of (4). 

In order to justify this definition it is necessary to show that there exists but 
one such solution g u (x) t •••» £*(*)[ A,»(x), *' *» ^ .*(*)]• Suppose if 
possible that there exists a second solution • • •, &*(*) with the proper¬ 

ties stated. Now we can express g,„(x) as follows: 


&•(*)* 0ll(*>J>l(*> + **• +9u(. X )P*( X ) (« = 1.2, ",M), 
where />,(»), •••♦/>.(*) are periodic of period 1, inasmuch as g XJ (x) % •••, 
g„t (*) (j — 1 * 2, • • •, n) form a fundamental system of solutions. If we solve 
these equations for p t (x) % we shall have 

?„(*) &*(*) ••• *!.(*) 


™ ' 0n(*) ••• 9u(*) 

. 

- - - !/..(*) 

Consider now some definite period-strip formed by two lines parallel to the 
axis of imaginaries and one unit apart, and let x become infinite in this 
strip. On account of the known asymptotic form of the elements <j and 7j in 
the strip to terms of the first order we find at once, if i' #= &, 

for x at either end of the strip. If we write z = e 2 "'“‘ x , this may be written 
Pi {x) - z^^-^og z ) r *“ r '[0]. 
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The upper end of the strip corresponds to z = 0, and the lower end to z = oo. 
The above relation shows that />,(*) has a zero of at least the order X rt . at 
z = 0. Likewise, this relation shows that />-(*) has a pole of order \, k — 1 
at most at z = oo. Consequently />(x)z _A ‘* is analytic at z = 0 and vanishes 
at z = oo. Moreover, by taking x far enough to the left, the original determi¬ 
nant expression for p t {x) * s analytic since every element of the two determinants 
is analytic, and the denominator is asymptotic to the formal determinant |s. y (*)| 
which is not zero if x lies far to the left. Hence /»,•(*) for i 4 = k % is 

analytic for all finite values of z and vanishes at z = oo; it must be identically 
zero. We conclude that 

(/A x ) = 


Now let i be chosen so that s, t , the constant of the principal term of a. 4 (.r), is 
not zero, as is possible by (7). Consider any point x 0 and the points x 0 - 1, 
x 0 — 2, •.. to its left. As x takes on these values, j lk (x)/g tk (x) tends toward 
1, and thus we see that />*(*•) is 1 at the point r 0 . That is, y u (x) and y lfc (x) 
are one and the same function, in contradiction to our hypothesis. 

If the functions «<,(*) had not been restricted to be rational, the only modi- 
fication in §§ 1-4 would be that in the statement and proof of (6), Theorem III, 
other singularities than poles would enter. 


§5. The Fundamental Periodic Functions. 

Either of the two principal matrix solutions G (x) and //(.»•) of (4) may be 
expressed in terms of the other, e. g., 

(46) G(x) = II(x)P(x) % 

in which the elements p,fx) of P(x) are periodic functions of period 1 which 
may be termed the fundamental periodic functions. We shall now determine 
the nature of these functions. The fact that will enable us to do this is that 
G(x) and II (x) are composed of elements of known asymptotic form in any 
period strip which may be taken to be one and the same in both cases. 

On account of the gain in simplicity we shall make a preliminary transforma¬ 
tion. Let 

d(x) = (x — a)(x — 0) • •• (x — X) 

be the least common denominator of the set of rational functions a.fx), and 
write 

y,(x) = T(a: — a)T(x —/9) • • • T(x — X)*/,(.r) (* = 1, 2, •••, n), 

where F(x) is the gamma function. The transformed system will be altered in 
that the common denominator has been removed. The formal solutions of the 
new system will be those of the old multiplied by a certain factor, and hence 
will satisfy the restrictions imposed in § 1. 
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The new coefficients are polynomials in x. It is then permissible to take all 
the functions «,(«) as polynomials, say of degree M or less, and this we shaldo. 
The first principal matrix solution G(x) is constituted of elements analytic 
throughout the finite plane, in this case, inasmuch as the elements a„(x) have 
no poles in the finite plane. 

From the equation (46) we have 

(47) P(x) = //-•(*)£(*); 

but if we take our period strip far enough to the right, the elements of H(x) 
are analytic functions of x and the determinant of H(x) will not vanish in the 
strip inasmuch as its asymptotic form is given by the determinant K(*)l- 
Consequently the elements />„(*) are functions of * = t"*-* which are analytic 
except perhaps at s = 0 and z * oo, corresponding to the upper and lower end 
of the period strip. Iu the case when the functions a tj (x)\n (1) are merely 
rational it is clear that the periodic functions p^x) are analytic or have poles 

except at z = 0 or z * oo. > . . * 

It is necessary now to remove a certain ambiguity. The expression s..{x) 
rep.esei.ts the elements jfc ; (x) and A y (x) asymptotically in any right and left 
hand side of the plane respectively, and is a many-valued expression which is 

changed to 

(48) «„(*) 

after a complete positive circuit of x - 0 [see (6)]. Hence if the same deter¬ 
mination of ,„(x) is chosen for ?u (x) and A u (x) along the direction of the 
positive axis of i.naginaries, the elements g v (x) and A u (x) will have a different 
asymptotic form along the direction of the negative axis of imaginaries, since in 
one case we pass from the positive axis of imaginaries to the negative by a posi¬ 
tive partial circuit, and in the other by a negative partial circuit. (It must be 
remembered that g { .(x) and A 0 (x) preserve their asymptotic form in any left 
and right half plane respectively.) Let the determination for A 0 (x) be termed 
s 0 (x). Then that of g 0 (x) is obviously obtained by a complete positive circuit 
and is given by (48) in the direction of negative imaginaries. 

In the upper half of the period strip ?„(x) and h„(x) have a common 
asymptotic representation s v (x), and this proves that 

P{x)^ S-'(x)S(.v). 

But the elements of s' 0 (x) and s 0 -(x) of S~’(x) and S(x) are of the respective 
asymptotic forms 


X»'p-’X- r ‘ (T,j + + 
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(see (9) and (6)], so that in consequence the relation holds 

Therefore, if we think of PiJ (x) as a function of z, we have 

logz \ r «-> 

uJ- -V 

The single-valued analytic function PiJ (x) of z is accordingly infinite only to a 
finite order at z = 0 and either has a pole there, or is analytic. For i =j it is 
clear that P(J (x) is an entire function of z taking on the value 1 at z = 0, that is 

(49) ;>„(*) = l+ci:><r' jri> + •••• 

For i +j we have obviously 

lim Pu (x)z ~** = constant 

since was defined as the least integer as great as the real part of 

logft-logp, 
v 2 t tV—\ 

Hence PiJ (x) has a zero of order X v , at least, at z = 0, and may be written 

(50) Pv (x) = c'-^ r '‘(c'J> + c<"e'- J ’'' + •••)* 

the coefficient of the exponential factor being an entire function of z. 

It remains to consider the functions /><,(*) in the lower half of the period 
strip. Here the functions g^x) and h v {x) have the respective asymptotic 
forms 

and the matrices G(x) and H(x) are asymptotically represented by 

S(x) and £,(x) 

respectively where 

S,(x)- «(«)(^ J - i W- , 8,).' 

Hence we obtain 

P(x)~S-'(x)S,(x) or [S- , (*)5(*)](^- J 

or S, v ). 

Thus each function P ,j(x) satisfies the condition 

•The notation (0 tj ) signifies a matrix with term H tj in the ith row nndjth column. 
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We see again that ,„(*) becomes infinite only to a finite order at a = » 
This demonstrates that p. ( (x) is a polynomial in 2 and that P// (x) (i+j) 
equal to z* multiplied by a polynomial in 2 . 

In the case i =j we have 

lim Pu^z-* = <r ar ' J -\ 

so that the polynomial p u (x) is of degree p in 2 with leading coefficient . 

In the case i + j we have 

li = constant, 

*=• 

so that the polynomial coefficient of 2 ^ is of degree p-l. 

Thus we have proved the following theorem: ... 

Theorem IV. Let G(x) and II(x) be the first and second principal 
matrix solutions of (4) in which the elements of II(x) are polynomials of 
deyree p at most in x, and write 

G(X)=II{T)P(X). 

The fundamental periodic function, p„(x) which constitute P(x) are in this 
case of the form 

,»„(*) -1 + + • * - + ■ 

(51) />„(*) - [<»> + c •!)e ! ’ J - i '+ ■ ■. 0 +J). 

in which X„ denotes the least integer as great as the real part of 


j— / =f (1 °gP,- l 0 SP, ) 


The simplest illustration is given by the difference equation for the Gamma 
function, when one has 

= i, M x)-r ( x). 

A similar analysis of the nature of the functions p 0 (x) at 2 = 0 and 2 = oo 
may be made in the more general case when the functions a u (x) are not 
restricted to be rational functions, and shows that PiJ {x) is analytic or has a 
pole at 2 = 0 and 2 = 00 with leading terms of the same form as those of (51) at 
these points. If the a {J [x) are rational, it is obvious that the functions Pi ,(x) are 
rational in e uJ - u ; we shall not stop to investigate further the structure of 
these functions. 
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§6. The Principal Matrix Solutions in the Entire Plane. 

The results of the preceding paragraph make it possible to discuss the asymp¬ 
totic form of the elements of G(x) [and likewise of //(*)] in the entire com¬ 
plex plane. In the first place it has been shown that 

along any ray from x = 0 in the second or third quadrants. To investigate the 
asymptotic form along rays in the first and fourth quadrants we make use of the 
equation 

f» 

(52) G(x)-H(x)P(x) or ?<,(*)-(U-M. 

in which the elements h t) (x) are of known asymptotic form s 0 (x) in any right 
half plane, and the elements / > 0 (x) are known functions of x. 

Consider now a ray from x = 0 in the first quadrant and lying above the axis 
of reals. Along this ray, by equations (51), we shall have 

p 0 (x) — where X (l = 0, cfi> — 1 . 

Thus the preceding equation for £ 0 (x) leads to 

(53) 9<i( x ) ~ +■•■+ 

a formula which determines the asymptotic form in the first quadrant. 

To determine the dominant term we need to compare the relative magni¬ 
tude of the terms 

rtf** 3 ** — 

If these be divided by p', the logarithms become 


/ log Pl - log p ■ 
V 2tti - 1 



Or, 




By the very definition of X t> , the coefficients €,y, • • •, €y y = 0, • • •» e« y of X in these 
expressions will lie in the strip 0 ^ t < 2 tt. Let I \, - - P m be the points which 
represent these coefficients in the complex plane. On account of (8) these coef¬ 
ficients will have decreasing real parts and the points P l% • • -, P m are ordered 
from right to left. As argx increases from 0 to Itt the above logarithms main¬ 
tain their relative positions but are rotated about r = 0 through the same angle. 
The dominant term along the ray arg x = t is that one for which the corre¬ 
sponding point is farthest to the right (i. e., the logarithm has the greatest real 
part) when a rotation r has been effected. None of the terms after the J-th 
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can dominate since the corresponding points are to the left of P, = 0 for all 

values of t < tt/2. 

Draw the convex broken line P.P, • •• Pj below which none of the points 
p * ..., Pj lie and whose vertices are at some of these points. If the plane 
be'rotated through an angle r, it is clear that these vertex-points, and only 
these, will correspond to dominant terms. Let the acute angles which the 

successive sides of P, P 9 • • • Pj make with the axis of reals be <*»,, <*>„,. 

The critical rays along which the dominant term changes are then in order of 
increasing magnitude 

o — » o ” ^* » * * •• 


For small values of arg x the dominant term is clearly given by 
and as the first critical ray is passed this changes to 

Along the ray the form is given by a combination of two or more terms, a fact 
which we shall express by saying that the change is uniform along the ray. A 
similar change takes place along each of the successive critical rays until the last 
is passed when the asymptotic form becomes s 0 (x). 

Likewise in the fourth quadrant the asymptotic nature is deduced from a 
relation 

The expressions s (l (x), •••»*<„(*) which appear here are the determinations 
for the functions A <( (g), • • •, A fa (x) in the right half plane. Consequently the 
determination of s u (z) obtained by a positive rotation from the first to the fourth 
quadrant along the negative axis of reals is this determination multiplied by the 
factor . Hence in terras of the second determination 


••• +»</(*) 

After division the corresponding logarithms are 
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The coefficients of x are represented by points ft, - - - , ft = 0, • • •, ft. 2 tt 
units below the corresponding points l\ (except for P. = ft = 0 ). I hese 

points all lie in the strip — 2ir ^ v < 0 and are given in order from right to left. 
As arg x decreases from 0 to - x/2, the figure formed by these points rotates 
through an angle — tt/ 2, and the dominant terms correspond to the point farthest 
to the right. None of the terms after the jth can dominate. 

In this case we are led to the following rule: draw a convex broken line 
ft ft ...ft above which none of the points ft, ft, •••, ft lie and whose 
vertices are at these points. Let the acute angle which the successive sides 
ft ft ... ft make with the axis of j: be f,, •••• The critical rays 

along which the dominant term changes are then 


7 r , it 


in order of decreasing magnitude. For small arg* the dominant term has the 
asymptotic form 

and as the first critical ray is passed, this changes uniformly to 


and similar changes take place along the remaining critical rays until the last 
ray is passed, when the form is s, f (x). 

Theorem V. Ltt the coefficients he polynomials of tlcyrcc not 

(/renter than p in X. 1 Vrite 


V — 1 


/ log p, - log Pj 



e 


•J 


where \„ is the least integer ns great ns the real part of the negative of the 
first term in parenthesis , and where p l% p r • ••,/>„ ore the roots of the 
characteristic equation ordered according to decreasing absolute magnitude. 
Mari' the two sets of /toints 

e t j , t.j , 0 and e,j — 1, e.j — 1, •••, — 1* 0 

and call them respectively 

1\ % />„ and ft, ft, ft. 

Construct the conrcx broken line 1\ P c • • • I ) J above which all the remain¬ 
ing points P lie , and likewise construct the convex broken line ft ft ••• ft 
below which all the remaining points (£ lie. Let the acute angles which 
the successive sides of P x P 9 • • • P- and ft ft • • • ft make with the 
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axis of reals b — r “P <el * W » The C "'‘ W 
between are, x = 0 and arg x = <la are then 




77 77 

-o + V'#* -o + Y» 


2 

in «»</«/<»• or*r. 77,< asymptotic form of g,,{x) changes uniformly from 

sJx)e--et'-'-C 10 

f/,* cWf.W ray «'•? * - 1* ~ *« " nd l^msefrom 




-»>* -I' *-•> 


l*-I Wm-U 
c yj 


to 








a; 


M« critical rag arg x = - }* + Between the last critical ray of 

the first set and the first of the last set the asymptotic form is given by $ u {x). 

it is interesting to observe that with *„(*)■ • - •. U.j{*) «'*'* are a^cated 
at most j - 1 critical rays in the first and in the fourth quadrants so that, for 
example, when j - 1 there are no critical rays. But we have g x {x) = ",(*)> 
o tx )« !/.(*) and this result is in accord with Theorem 1. 

There are two special cases of exceptional interest. The first is that in 

which p r p.t • • •» P. are of e T ,al a ^ solute va,ue - Tlie q uantities •<> are m tl,,s 
case all pure imaginaries and the broken lines P\%P 9 % ••• and Q,, Q«s ••• 
reduce to segments of the imaginary axis, and we have <*>, = tt/2, yfr x = W 2 ’ 
so that the critical rays fall along the positive axis of reals, i. e., do not exist 
since this line is excluded throughout. In this event the elements of G(x) 
preserve their asymptotic form throughout the entire plane, a result which 
agrees with that obtained in § 1. The second case is that in which the argu¬ 
ments of p,, • • •, P, are equal. In this case the quantities e tJ are all real. In 
the first quadrant the only critical ray is the positive axis of imaginaries. In 
the fourth quadrant one finds critical rays different from the negative axis of 
imaginaries. How shall we reconcile these unsymmetric and apparently contra¬ 
dictory results, since by Theorem III the positive axis of imaginaries is not a 
critical ray ? If one of the constants iff] which determine the change in asymptotic 
form vanishes , the formulas are still valid but do not give the dominant term.* 
The true critical rays in the first quadrant are symmetrically placed with respect 
to those in the fourth quadrant. 

In general the effect of a zero value of c'J (or of c-" 1 ) is to shift the 
actual change in asymptotic form to a secondary critical ray making a smaller 
angle with the positive axis of reals. The complete determination of its posi¬ 
tion may be effected when the constants c.; are known. 

The critical rays for these solutions have been determined by Galbrun by 

-Just as. for example, if « = *' we have .«— e^ (0) along the positive axis of reals, but this 
does not give the dominant term. 
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use of the Laplace transformation. It may be noted that the above results hold 
when the functions a u (x) are only restricted as in (2), since only the form of the 
functions /> y (x) at z = 0 and z = oo has been made use of. An interesting dif¬ 
ference between the case when the functions « 0 ( x) are rational and the case 
when they are not lies in the fact that there are a limited number of possible 
positions for the secondary rays in the first case, and an unlimited number in 
the second. 

§7. The Fundamental Problem. 

The properties which we have thus far adduced completely characterise the 
solutions of a linear difference system (1) with rational coefficients, as is evi¬ 
denced by the following theorem : 

Theorem VI. Let G(x) and //(x) be two matrices of single-valued func¬ 
tions, analytic save for poles in the finite plane and such that their elements 
have the property 

lim g lJ (x)x-*'p-'x-''= 3 U and lim h g {x)x-**pj'x-'' - V (l s ol + °) 

Mmm 

for x in any left or right half plane respectively , and furthermore let 

G(x)-H(x)P(x) 

where the elements of P(x) are periodic of period 1. Then £(x) and //(x) 
are the first and second principal matrix solutions of a system (4) in which 
the elements of A (x) are rational in x, at least if />,, p i% •••»/>„ arc distinct. 
Proof. If we put 

yi (x) =s G(x+ 1 )(?“'(x) = //(x+1) //"* (x), 

as we may by the hypothesis of the theorem, it is clear that the elements n 0 (x) 
of A (x) are single-valued and analytic in the finite plane except for poles. Fur¬ 
thermore, by direct computation, the elements ^' y (x) and /< y (x) of G~ l (x) and 
//“‘(x) are seen to be such that 

lim ( ! ) ( PiY— a iJ and 15m h ‘J (■*) ■***'( Pi Y‘ xU = » 

/3* 

in which <r tJ denotes as before the element in the ith row and Jth column of 
the matrix inverse to (#„). The variable x lies in any left and right half 
plane respectively. Therefore we have 

M x ) = £.9ir( x + 1 )ffr j(*) — + l)KA*)* 

»=| T=| 

fr | 

Tran*. Am. Math. Soc. 10 
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where < (r and t„ tend to zero for all large values of x , that is, in overlapping 
right and left half planes. This may be written 

1 + x) 

where < , tends to zero as * becomes infinite. 

Thus a w (») must have a pole of order y. at a; = oo with leading coefficient 

■ 

T = 1 

and is a rational function of x. In order to find the roots of the characteristic 
equation (3) in this case, we make an initial linear transformation with constant 
coefficients on ?,(*), • • •. 9A*) in tbe linear difference system 

n 

<*(*) = £ «.>(*)?;(*) 

1-1 

which has been constructed. The constants undergo the same transformation 
and may be reduced to ; thus we shall have Uj — so that the invariant 
characteristic equation is 

(p-pi)(p-p*) ••• (P“/0 “ 0 

with distinct roots p,, p. not zero. Hence the above system satisfies the 
restrictions of §1. According to the definition of principal matrix solutions, 
G(x) and H(x) must be the first and second principal matrix solutions, since 
the principal matrix solutions are the only ones which preserve a definite asymp¬ 
totic form a 0 (x) even to terms of the first order in any left or right half plane 
respectively. 

It is evident that the n* + 2n exponential constants p t . r,, s,j together with 
the functions ;> w (x) characterize the principal solutions more or less completely. 
Let us turn again to the case in which the coefficients o 0 (x) are polynomials of 
degree y in x. In this case the equations (51) show that the functions p u (x) 
involves n 2 y- n further polynomial constants <*$. But among the exponential 
constants there are n which are not essential since we can alter any set s u% • • •, s til 
by an arbitrary multiplier and not affect the associated linear difference system 
(1). Hence there have been specified in all 

(n* + 2>i) +(»V— n) — n or »r(/*+ 1) 

characteristic constants counting only the ratios .« u , ■%*, • • •, as character¬ 
istic constants. This number is precisely the number of arbitrary coefficients 
in the n 2 polynomials a tJ (x), each of degree y or less in .r. Now the system (1) 



LINEAR DIFFERENCE EQUATIONS 


281 


1911] 


determines the constants, but not uniquely since r,, — , r n may be replaced by 
any determination r, + 2m, V — i , •••»**«+ 2m a V — 1 (m,, • • • * integers) 
in the formal expansions. There is however exactly one set of characteristic 
constants for each such determination. If the characteristic constants are inde¬ 
pendent, one may conversely regard the coefficients in the polynomials a 4J (x) 
as functions of the characteristic constants. The following definition and theorem 
go to show that conversely the characteristic constants do describe all the essen- 
ial properties of the solutions : 

Definition. Two linear systems (1) are said to belong to the same class if 
one may be transformed into the other by a linear transformation with rational 
coefficients. 

Theorem VII. Two linear difference systems (1) in which p l% p r • • •, p„ 
are distinct and in which the functions a %J {x) are polynomials in x of degree 
ft will belong to the same class if their characteristic constants are the same. 

Proof. Let 

(54) (?,(*+ l).il l (9B)(7 l (x) and G t (x + 1) - A t (x)G 2 (x) 

be two systems of this kind, expressed in matrix notation, which possess the 
same characteristic constants. For convenience we suppose both of the equa¬ 
tions transformed into a normal form in which 


(55) a tJ (x) = Pj{S ii x*+*: j 'x-'+ •••)• 

This may always be effected by one and the same linear transformations with 
constant coefficients, 

G(x) = C G(x), 

namely by that transformation which reduces the constants to S tJ , 

This is always possible by (7), and if one substitutes in the new formal expan¬ 
sions, it appears at once that the transformed elements of A t (x) and ^l 2 (.r) 
have the form (55). The exponential constants p Jy r } and the polynomial con¬ 
stants are of course unaltered by this transformation, and we have explicitly in 
each case 

(56) (i = l, 2,- .•,»). 

This simplification being effected, let us consider the relation between the first 
principal matrix solutions (?,(*) and G : (x). Write 

(57) G t (T)=Ii(x)G t (x). 

We find then 

(58) /l(x) = G l (x)G;'(x)=I/ t (x)r(x)[/f 2 {x)P(x)]- l =fr i {x)II:\x), 
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where H (x) and HAx) are the second principal matrix solutions and P(x) is 
the common matrix of fundamental periodic functions. It is now easy to dem¬ 
onstrate that R(x) is a matrix of rational functions of x. The elements «„(*) 
:r!us matl hare only poles in the finite plane and the relation (58) shows 


that 

(59) 


lim R(x) = S tJ 


in the complete vicing 'f*=« on account of the common asymptotic form 
of G and G„ //, and H.. In consequence of (59) the elements r„(x) must be 
rational functions of x, analytic at x=oo and taking on the value « at that point. 
Substitute the value of <?,(*) given in (57) in the first of the given difference 

systems. There results 

G,(x +1) « [fl-’(x+ 1)/I,<?:(*)■ 


A comparison with the second of the given systems shows then that 
(60) A t (x)R(x) = R(x+l)A,(x). 

This is the condition that the two systems belong to the same class and are 

transformable, the first into the second, by (57). 

Even if this theorem is true there remains the possibility that not all of tile char- 
acteristic constants are independent, although we are now certain that these 
constants effectively determine the transcendental properties of the solutions. 
We may however say something more of the dependence of solutions on the 
characteristic constants. 

In the first place we observe that the exponential constants are expressed in 
terms of the coefficients in (56) when the reduction to normal form has been 
effected. Now the principal matrix solutions of (1) were constructed by means 
of certain limit determinants and a process of summation based on a contour 
integral. All the processes employed are uniformly convergent in the coeffi¬ 
cients <ff] and involve them analytically. It follows that the two principal solu¬ 
tions G(x) and //(*) of (1) are analytic functions of the constants a*,? as 
well as of x, and hence the elements of 


P(x)-G(z)H-\x) 

have the same property. This necessitates, of course, that the coefficients d'J be 
analytic in the coefficients a*!). Let us take p,, • • •, p„, r,, • • •, r n as fixed, 
which amounts to choosing in (56) once for all. The coefficients d/] are then 
single-valued analytic functions of the remaining coefficients u\'J, which appear 
jn precisely equal numbers. 

Now only two things are possible: either the characteristic constants are inde¬ 
pendent and one may conversely regard the quantities as analytic functions 
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of the quantities cor else they are dependent. In this latter case we should 
be able to choose 


=<f> tJk (\) for all », j, k % 


where is an analytic function of the parameter X, in such a way that the 

characteristic constants are independent of X. The functions a^(x) are then 
dependent on X, so that we may write A (x) = A(x, X). Furthermore we can 
take generally 

A,(x) = A(x, X # ), A t (x) = A(x, X) 


as two determinations of A (x) which give equations (54) with the same charac¬ 
teristic constants, and we may take X = X 0 as a value of X for which the func¬ 
tions 4> iJk (x) are analytic. The matrix R(x) is defined by (57), where £,(x) and 
Cfj(x) denote again the first principal solutions in (54). The elements r^(x) 
are rational functions of x by the preceding theorem. 

We will now show that the functions r, y (x), which depend also on X, must be 
polynomials in x. To this end we make the further assumption that no two of the 
/*" roots of the determinant equation | A (x, X°)| = 0 differ by integers. This 
is manifestly the general case. 

If one of the elements of li has a pole at a point, it will be possible to form 
a group of such points 

a, a -f 1, •• •, & 


containing the special point and such that a — 1 and 0 -f 1 are poles of no ele¬ 
ment of li. Now if we consider (60) at x = £, we see that the right-hand 
member is analytic at this point, and that the left-hand member will have at 
least one element with a pole at x = & unless | -4,(x)| ~ 0 at x = 0. Hence 
we must have J,(^) = 0. Likewise if we consider the same equation at 
x = a — 1, we see that the left-hand member is analytic at this point, and that 
the right-hand member will have at least one element with a pole unless 
| A t (x)\ = 0 at x = a — 1. Hence we must have — 1) = 0. But for 
X near to X 0 , the roots of 

M.(*)l-M(*> x .)l = ° an<1 x )l = ° 

will nearly coincide and no two of them, as a — 1 and respectively, can be 
congruent and distinct on account of the hypothesis concerning the roots of 
| A (x y X 0 )| = 0. It follows that the elements of 7?(x) can have no poles in 
the finite plane, and hence are all polynomials in x. According to (59) these 
polynomials reduce to at x= oc, and hence are identically equal to 8, 7 . 
The elements of A (x, X) will not depend on the parameter X, contrary to 
hypothesis. 
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Hence the characteristic constants are independent, and the coefficient).inthe 
nolvnomials a,Ax) may be conversely regarded as analyt.c functions of these 
characteristic constants. A fundamental problem of the linear difference 
system is to study this functional relation and in particular to determine 
whether there exist, a system <1) with coefficients polynomial in x oj degree 
not greater than „ having any assigned set of characteristic constants. 
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FORMAL THEORY OF IRREGULAR LINEAR DIFFERENCE 

EQUATIONS. 

By 

GEORGE D. BIRKHOFF 
of Camiiridoe, Maks.. U. 8. A. 


Introduction. 

The general theory of ordinary linear differential equations with rational 
coefficients must be regarded now as known in ita broad outlines: there exist 
in all cases whatsoever a full quota of formal series (normal and anormal) for 
each singular point; on the basis of these series, the behavior of the solutions 
in the neighbourhood of the singular points can be adequately characterized, their 
nature in the large can be determined by means of the monodromic group, and 
finally the inverse Rieraann problem can be formulated and solved by direct use 
of the Fredholm theory of integral equations or otherwise. 1 The details have 
not been carried through except when the series which enter are of normal type, 
but the corresponding formulation and attack in the most general case are 
sufficiently evident. 

On the other hand the situation is much less satisfactory for ordinary 
linear difference equations with rational coefficients. If these equations be 
written as a linear system, 

N 

(0 ydx+1) = 2i°iA x )yA x ) (*- >• 2 * • • ■")» 

in which the w* rational functions flyj(x) are analytic at x = » or have a pole 

1 Cf. my paper, The Generalized Riemann Problem for Linear Differential Equations and 
the Allied Problems for Linear Difference and q Difference Equations, Proc. Am. Acad. Arts and 
Sciences, vol. 49 (l9*3>- PP 5 21 — 
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of maximum order „ there, then it is only the regular case in which the charac 
teristic equation in Q, 


( 2 ) 




o, iV>; afi — lim 


has » distinct roots «... fc,:.. *.. not aero, that has been treated adequately. 
To be sure, Norland, in his fundamental work on linear difference equations, 
has given a powerful general method by which solutions analytic in certain 
limited regions of the complex plane can be constructed always', but this method 
affords little indication as to the nature of those simplest analytic solutions, 
devoid of artificial singularities, which are of central theoretic importance. 

The most general examination hitherto made of linear difference equations 
not of this special type is due to Adams ’ He discusses the extent to which 
my own method of approach to the regular case’ admits of extension to the 
irregular case, and finds that while similar results can be obtained in certain 
more general cases, yet the method appears to break down. In brief, he finds 
that so long as there are n types of formal series with elements of the form 


x'“g'xf(a + ^ + ) 0 **£/*)- 


the same method is applicable. Since these formal series are of the same kind 
as appear in the regular case, it is natural to group this particular irregular 
case closely with the regular case. In more general cases Adams only establishes 
highly restricted results. The difficulties which he meets with are of three types: 
(i) he does not arrive at a full quota of formal series in all cases'; (2) the 
sequences defining the determinant limits' converge, if at all, in much less exten¬ 
sive regions of the x plane than in the regular case; and (3) in consequence, the 

1 Cf. N. E. Nbrlund, Differenzenrechnung, Berlin. 1924. chap. 10. 

* C. R. Adams. On the Irregular Cases of Linear Ordinary Difference Equations, Trans. Am. 
Math. 80c., vol. 30 (1928'. pp. 507-541. In this pap” references to the work of Barnes. Horn. 

Batchclder, Perron, and Galbrun may be found. 

1 General Theory of Linear Difference Equations, Trans. Am. Math. Soc.. vol. 12 (I9«' - 

pp. 243—284. 

4 Not even in the case n = 2, in which many but not all cases have been treated by 
Batcbeldcr. Batchelder has not published these results. 
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determination of intermediate solutions' and principal solutions', as in my paper 
by means of series and contour integrals, fails. 

It is the aim of the present paper to show that, when certain further types 
of formal series involving logarithms are taken account of, there will always be 
a full quota of n such formal series solutions. By way of application of this 
basic result, certain interesting formal questions are also treated. 

In a subsequent paper devoted to the analytic theory of irregular difference 
equations, also to appear in these pages, I expect to present the extension of 
my earlier theory to the truly general case. Such an extension would of course 
be impossible without the result of the present paper. It involves appropriate 
modifications in the method of contour integration as well as other changes of 
consequence. 


§ i. The Linear Difference System and the Single Equation. 

Without essential restriction it may be assumed that the determinant 
I | in (i) is not identically zero. In fact in the contrary case there is 
obviously at least one identical linear homogeneous relation between y,(x + 1), ... 
y„(x-fi) with coefficients which are explicitly given in terms of the minors of 
this determinant. If we replace x by x—i in this relation, we obtain a like 
relation between y,(x), ... y*(x). On solving for one of the dependent variables, 
say y n (x), in this relation, and elimination of y.(x) in the first n— I equations (i). 
we obtain a like equivalent system of order n — i. Proceeding successively in 
this way we arrive finally at an equivalent linear system of order < u, for which 
the determinant in question does not vanish identically. 

Now for such a system (i) let us write 

N 

( 3 ) yW = + ••• + M*)y-(*) ■ 2j***(*)y«(*) 

«■! 

where A,(x),... A*(x) are n functions, rational in x but otherwise arbitrary. By 
use of the equations (1) we obtain successively 


y(x+ 1 ) = 2 + MwM. 



y(x+11) = 2 + n)<iap(x+n— l) • • • . (x)y,(x). 

a..*,...*- I 
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On the right-hand aides of (3). (3') *• ■ + « Unear homogeneous eipreasiona 

in </,(*). ■ • • ».(*)• Hence from the8e e< t uation9 we obtain at lea8t °" e lmear 
homogeneous equation of the fora 

( 4 ) £(y) = a,(*)»(r +•») + a,(x)jr(x+n—1)+ + <•.(*) y(x) = o, 

in which not all of the coefficients <n(x) vanish identically, and furthermore these 
coefficients are explicitly expressible in terms of the functions X,, a,,. 

It is possible to choose X„ ... X. so that neither o„ nor a. vanishes identic¬ 
ally, i-e. so that the linear difference equation (4) is actually of the nth order. 
To establish this fact we consider the determinant of the linear homogeneous 
expressions in »,(*). ,,.*.(*) which appear in the first n of the n-M equations 
(3), (3'). The elements of the first row are *,(*), • • • *•(*) which olearl y be 
taken at pleasure at an arbitrary point x. The elements of the second row are 

n • 

2 u*+!)*-«(*). ••• 2 u*+o«mW 

—I — 1 

respectively. But *,(x-t •).-•- U*+ >) may clearly be assigned values at pleasure 
without affecting the values of A,(x), • • • *•(*) already selected. Inasmuch as 
| a ,j(x) | is not identically zero, it is therefore clear that the elements of this 
second row may be independently selected at will. For the third row a similar 
consideration leads to the conclusion that these elements too may be indepen¬ 
dently assigned at the given value of x, since the determinant which enters is 


12 a ‘A x + 0««M I * I *«(* + 1 ) 11 a *i WI * 0 ■ 

fi-t 

By proceeding in this manner it becomes apparent that the determinant of the 
coefficients of y„ ... y. in the first equations (3), (3') can be made not to vanish 
identically, since all of its elements can be taken arbitrarily at any point x 0 
such that |a<j(x)| is defined and does not vanish for x = x 0 , x 0 + 1, ... x 0 + n — 2. 
Likewise the similar determinant for the last n of these equations can be made 
not to vanish. Hence the equation (3), obtained by equating the determinant 
of the augmented system (3), (3') to zero will have the desired property 
</ 0 (.r)^o, a„(x)^o. 
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Therefore it appears that to a fundamental set of solutions of the system 
(i) with |a/j(x)|^o there corresponds a fundamental set of solutions of some 
single equation (4) of the nth order, and vice versa. 

If then we can establish that the single linear difference equation (4) pos¬ 
sesses a set of n formal series solutions in the usual sense 1 , it is evident that 
the general system (1) will also possess a set of n formal solutions, obtainable 
from the formal solutions of (4) by means of the equation (3). 

These obvious considerations justify us in focussing attention upon the 
formal series solutions of a single ordinary linear difference equation of the 
nth order of the type (4) with a 0 [x), a»{x) not identically zero. 


§ 2. Statement ot the Formal Problem. 

In order to deal conveniently with the questions which arise concerning 

the formal series solutions, it is desirable to broaden somewhat the initial 

formulation. Let us demand merely that the coefficients a t (x ) of the equation 

(4) under consideration are to be formal (i.e. convergent or divergent) power 

1 

series in descending integral powers of x, or, more generally, of x v (p, a positive 
integer). Here only a finite number of positive integral powers are permitted 
to enter. Of course the corresponding assumption for the linear difference system 
(1) is that the coefficients are similar formal power series, and the same integer 
p^i is evidently involved in associated equations (1) and (4), even if X t (x) t ... 

A„(x) in (3) are also such series in x* . It is obvious then that for the complete 
specification we must not only g:ive the equation (4), but also the value of the 
basic integer' p which is to be adopted; all possible values of p are evidently 
positive integral multiples of a least value p 0 ^i. 

The usual method* for the determination of formal series begins with the 

substitution of a series 

' We regard a set of n formal solutions as distinct (i.e. linearly independent) in case there 
is no identical linear homogeneous relation between them in which the coefficients are either 

constants or of the more general form (/. on integer). 

* Generalized here to the extent that we allow p to exceed I. 

27—29643. Ada malhtmalica. 64. Iraprim* le 12 avril 1930. 
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*(*) = /' e PII 'x' (“ + A + —) 

X 9 

(p(x) = yx + lx~ + ■■+ 1* p ) 


in the equation and the attempt to determine p.y, d. a.b,... by the 

method of undetermined coefficient*. Here the formal identities: 


*(.ee a til « «-'/ti tip? \ 

(x + 0 * =*-x'(. + -) 

px> 

(x+.r-x'( ■ +j+"): 

*1 

for i- enable us to remove a factor x' e A "’x' from the given equa¬ 

tion after this substitution ,v = «(x) has been made. When this factor is removed, 
the n + > leading terms in the n+l series which appear on the left are precisely 

ft n p !■— I) —ii*" 

aoj,e p x 9 , a 0 ) t e 9 x 9 t ...a m j m x 9 , 

-U 

where we have written a, u for the coefficient of the leading term a IJt x 9 in 
a t (x) for t-» 0,1 Obviously neither j 0 nor > can be infinite since neither 

a 0 (x) nor «*(*) is identically o. 

Now if such a formal identity is to be possible, there must be two leading 
terras of the same degree in x , so that 


Cf fif/t 1 




for some l and m [ly*m), while all other terms are not of higher degree, i.e. 

—jt + /I(w—«)£ —jm + /i( n—ro ) (l=0, 

whence 

jm^— #*(*’”"*)• (*=0, 
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But these conditions admit of a very simple geometric interpretation. 1 Let 
(i,j) represent the cartesian coordinates of a point in the plane for which the i 
axis is directed horizontally to the right and the j axis is directed upwards. 
Mark the n+ i points ( i,jt) where * = o, (see 6gure i). Evidently the 

conceivable values of y are given by the negatives of the slopes of all possible 
lines through two of these points, while the inequality imposed will only be 
satisfied if all the remaining points lie above or on such a line whose equation is 

O'-i-)-— *(*—«•)• 



Fig. I. 


This leads us to a unique broken line L, concave upwards, whose vertices fall 
at certain of these points, while all of the other points lie above or on this 
line L. Furthermore the points (i,j) (i,j, being integers) which fall below this 
line can correspond to no terms in the n+ i series under consideration. 

When, however, y has one of these values, and the coefficient of the 
highest power of x on the left is equated to zero, we obtain the characteristic 
equation’ in p = e r , 



in which the first and last terms in parentheses correspond to the extreme 
vertices (i,ji), ( i'.ji•) of the line segment of slope y in the above diagram, and 

1 Cf., for instance, N. E. Norlund, Differenzenrecbnnng, pp. 312—313. 
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ln which the intermediate terms correspond to the intermediate marked pomU 
on that line segment. Clearly in this manner are obtained as many equations 
as there are values of p. If these values of p are denoted by p„ f*.. • • •#** 
with p,>p,> ■ >p,. the total number of non-zero roots (counted according to 

multiplicity in the separate equations) is precisely «; for if o. 
be the values of . for the successive vertices, these roots are in number precisely 
« -» , • • ii-ik-i respectively. Thus there are in general k characteristic 

equations, father than a single characteristic equation as in the regular case. 
It is the essential advantage of the single equation (4) over the system (1), as 
basis from the formal point of view, that all of the possible values of p are 

immediately obtained for an equation (4). 

If now we proceed further in the attempt to obtain a formal series solu¬ 
tion by comparison of the terms of successively lower degrees in **> we are 
immediately led to the following results (Cf. Adams, loc. cit., § 1). 

If the values p,,...p* are all integral, and none of the characteristic 
equations have a multiple non zero root, the first comparison determines y as 
indicated above, the remaining comparisons give d,.... a, 6 , ... in succession, 
as specific functions of the previously determined constanta, and with b, c, ... 
in particular involving a as a multiplicative factor. Hence in this case there is 
obtained a full quote of formal series solutions of the type ( 6 ) under considera¬ 
tion, and these are evidently determined up to a factor 


Q& kmY ~ 


l* 


[k. an integer) 


corresponding to the arbitrary multiplicative constant a, and the ambiguity in 
the determination of y. Inasmuch as the detailed proof is entirely straight¬ 
forward and of familiar type, and is not necessary for our later purposes, it is 
omitted here. 

More generally, if some of the values p,,...p* are fractional, but no 
multiple non zero roots occur in the characteristic equations, let such a fraction 
be mil in lowest terms. There exist then corresponding series of the following 
anormal type 

(6') S{x) = xVe'l*V (a + + ) 


(«*) 


1P-1 

— yx + dx 


+ 
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Such a formal series solution has evidently l formally distinct determinations in 
all cases, since if x makes p positive circuits of x = o in the complex x plane, 

the factor x p is multiplied by a factor e*** and y is thus augmented by 
2npV — l, where p = m/l is not an integer. Here again there are always 
precisely n formal series, when these various determinations are taken into ac¬ 
count. But inasmuch as the method is perfectly straightforward and of familiar 
type, and moreover not necessary for our later purposes, it is omitted here. 

As Adams has noted (loc. cit., § i) the general' case in which some of 
the roots q of the single characteristic equations are of l fold multiplicity while 
p is an integer also leads to corresponding anormal series of type (6), so that 
again a full quota of series is obtained unless a certain secondary characteristic 
equation holds. Unfortunately, the method of direct comparison may lead to 
indefinite algebraic complications if this secondary equation is satisfied. 

There is, however, a third type of formal solution which may arise, but 
whose importance seems to have been largely overlooked. 1 Let *(x) be any 
formal series of the type (6) or (6'), and let f(x) be a second such formal series 
with the same coefficient preceding the power series as s(x), save that the 
constant r may be modified by an integral multiple of I Up. Then, for instance, 
there may exist two formal series solutions of the form 

(6") #(*),#(*) log * + l(x) 

in the case I, or 2 1 such series in the general case l^i. More generally, 
if s(x), t(x), ... ic(x) are k series of the form (6). (6'), all with the same /t, P(x), 
and if the constants r which enter differ at most by a multiple of I Up, then 
there may exist kl solutions 

(6") *(x), Mxjlogx + t(x), ... *(x)( logx)* + <(x)(logx)*-> + ••• + w{x). 

It will be observed that when log x is changed to any one of its other determina 
tions, each formal solution is augmented by a linear combination of the preced¬ 
ing ones affected with constant multipliers. We shall regard such series (6 ) as 
of normal type if / = I and as of anormal type if l> \. 

' See. however, N. E. Norlund. Differenzenrechnung. chap. II, § I, whore a specialized case 
( 6 ") of this logarithmic type is considered for those linear difference equations of Kuchsian type , 
in which the series <i,(x)/a 0 (x) begin with a term of not higher than degree —« in x. 
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in Part I of this paper to solve what may be termed 

te to attack the problem by a method based upon the notion of reducbility. 
Zh ielta successive reduction of these difficulties by a well-defined senes 
0 f steps, each involving only the solution of linear algebraic equates. 


§ 3. Solution of the Converse Problem. 

It is an easy matter to solve the converse problem by demonstrating that 
to every such set of « (linearly independent) formal series there corresponds a 
uniquely determined linear difference equation of the n th order 

Suppose, for instance, that *,(*),... ».(*) are .. series of the simplest type 
(6). The corresponding difference equation is then essentially given by 

y(x+n) y(x + n—1) .. y(z) 

*,(:r + n) »,(i+n—1)... *i(*) 

s.(x+a) »»(*+»—!)•"•• *•(*) 

In this case we have only to divide the (#+ !>■«. row ») the ex- 

ponential factor *>#**• in order to obtain the equation desired Moreover 
even if certain groups of the n series are of the anormal form (6) *«“*>« 
conclusion is possible, although the coefficients «,(*), «,(*).... «.(*) obta.ne 
are given in the first place as power series in descending powers of some root 

of *i. However, it is obvious that the equation written is in reality not altered 
if the various determinations of these series be permuted, so that actually the 

coefficients *(*). «.,(*),... «.(*) do not involve fractional powers of x» after a 
suitable factor is removed. To indicate briefly the situation in the case when 
logarithmic terms enter as in (6"), we consider the simplest possible case, namely 
the case n=2, J> = I, l = * > n there are a pair of solutions, 

s(x), $(x) log X + t(x ), 
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in which s(x), t{x) are of the type (6) with the same /x, y, r. The corresponding 
equation is then 


y(x+2) 

*(x+2) 

s(x+ 2) log (* + 2) + f(x + 2) 


y(x+i) 
s(x+ 1) 

s[x + 1) log (x + 1) + f(x+ 1) 


fW 

tf(x) 

*(x)logx + f(x) 


If we multiply the second row by logx, subtract from the third, and make use 
of the formal identity 


log (x+t) = logX + 




we may eliminate the logarithms and thus obtain the desired equation of type (4). 
Obviously a similar manipulation leads to the same result in the most general 
logarithmic case. 

The solution of the converse formal problem thus obtained in all cases 
is evidently unique, since if we write out equation (4) with y(x) replaced by 
*i(x), ... s n [x) respectively, we obtain n linear homogeneous equations in the 
n + 1 coefficients a 0 [x) t a,(x), ... On(x), which determine them up to a multiplicative 
series factor, just because the formal determinant of the n th order |#«(s+y- H l)| 
is not identically zero. 


3. Solution of the Formal Problem for n— 1. 

We will begin with a proof of the following fact: 

Every equation (4) of the first order (n= 1) has a series solution of the form (6). 
Such an equation may be written in the form 

y(*+l) = *'( fe S + ^+ jy(x) 

X* 

when n is An integer and p is a constant not zero, inasmuch as we may divide 
the equation through by a 0 (x) and transpose the term in y(x). 

If we change the dependent variable by substituting 

yW = xVp'y(i) 
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and- divide through * the -- of *(* + 0. - ^Hed elation takes a 
similar form, simplified to tbe extent that , » — and ? ,s , 

b. 


#(*+,)-=»(*)»(*) (»w-« + V ) 


Now introduce the dependent variable r = lo gy , and we obtain at once 
(formally) & jb.-b'. 

r(x+ i)-*w='os fc <*>=i + -hr 1 * 

x 9 2X p 

-V V- 

X* x’ 

If in this last equation we wnte 

, M + -+•-** + '» logI + fllT + 
it b immediately found that r_„.. • - *. are uniquely determined with 


Z—p +1 


,...*-1= C ’Z'_- • *o ~ «!>• 


'-i ' p p 


-j 

These results emerge by direct comparison of the terms in * * , x on 

both sides. Now the comparison of term, in *T (*>,) leads to equations of 
the form 


-k + 


■ft_, + 9>. = «* (*— p+1 . p+ 2 ,...), 


in which ?, is a known polynomial in the coefficients * which precede ft-,- 
Thus s„ r„ ... are determined in succession and uniquely. 

Evidently the formal series for *(*) so obtained leads to a formal senes of 
type (6) for y, so that the proof of the italicised statement is completed. 

1 In the case p- l, this is to be written as 

*(*)-*. log*+ *!*-» + ••• 
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§ 4. Simplification of the Formal Problem. 

We propose to simplify the formal problem by establishing the following 

fact: 

There will necessarily exist always a complete set of foi~mal series solutions of 
types (6), (6'), (6”) if only every equation (4) admits at least one formal series solu¬ 
tion of the non-logarithmic types (6), (6'). 

The truth of this statement may be argued as follows: 

Suppose if possible that the statement is false. In this case, even though 
such a formal series solution (6), (6') always exists, yet there are equations (4) 
for which a complete set of series solutions does not exist. There will then be 
a least value of n for which a complete set does not exist, and, according to 
the result of § 3, we must have n > 1. 

Let s(x) be one of the formal series solutions (6), (6') of such an equation 
(4) of least order n> 1 for which the theorem fails, and write y = s{x)y. After 
division through by a suitable factor (for instance, s(x + n)), we obtain an equa¬ 
tion of the form (4) in y , and of order w, in which, however, the basic integer 
p is perhaps replaced by some integral multiple Ip. But this new equation in y 
admits of the obvious formal solution y= 1, so that if we write the equation 
in terms of y, Jy t J*y t the term in y disappears; in other words we have to 
deal with an equation (4) of order n — k in J k y where J k y is the lowest order 
difference to appear explicitly. But this is an equation of the form (4) in J k y 
of order n — k<n, and hence our hypothesis ensures that it admits a complete 
set of n — k formal series solutions $(x). 

Thus we are led to consider the formal difference equation 

J k y = s(*V 

It is clear that if we can show that this equation admits n—k formal solutions 
corresponding to the n — k known series 5 (x), we are led to a contradiction and 
the italicized statement must be true. In fact the equation in y is satisfied by 
the n — k series so obtained and in addition by the k distinct series forms, 
1, x, ... x* -1 for which J k y is o, so that the n series exist and are obviously 
linearly independent. 

Consequently it is clear that the italicized statement will hold if the follow¬ 
ing lemma can be established: 

28 — 29(143. A (la malhtmalica. 84. Imprim8 le 14 avril 1930. 
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V .(.) <■ * /•— -- •/ '«* » « n - *• 

Jy = *(x) 

r sr,:= 

if S[x) is Of the form (6) or (6'). We have then to consider an equat.on 


Jy = X> + ) (S “^ 0) ' 


wh ere „ is an integer, since we may replace p by the integral multiple Ip in (6'), 
In the case fi<o we substitute 


•ti 


y(x) = x p 


and, upon division through by the coefficient of y(x) in this equation, obtain 

p.U*»i«(i+ + i)] jl x+ *)-$(*) “*o + -1 + 

where the factor in brackets on the left has the form 

*’(«’*' + 4+A + ) 

' X* X* 


If now we substitute in this modified equation for y, 

y(x) = y 0 + ^1 + 

x** 


it is obvious that are determined in succession by 

with y 0 = — s 0 for instance. More precisely, the terms in x 
tion of the form 

(ft — ijk — St 


direct comparison, 
t 

p lead to an equa- 
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where rpk is a polynomial in y 0 , y lt ... yt— 1. Thus y,, y t , ... are determined in 
succession by taking k= I, 2, —, and the desired series solution y(x) is found. 
When n>o, we substitute 


tf r _e 

y(x) = iP *y{x) 


but divide through by the same factor as before. We obtain 

[V *• ,og ( l+ $ P Jy( z+ ,)_ x Ty( x ) = , 0 +il + ... (^>o). 


If now we replace y by the same series as before, and note that the term in 

i 

brackets is a power series in negative powers of x p , starting off with a constant 
term not zero, we see at once that $ 0 , &.••• are uniquely determined, with 

_T_C 

y 0 — s 0 e v for instance, and again the desired solution is obtained. 

There remains the possibility that s[x) is of type (6) or (6') with p = o. 
The same substitution for y as in the first case leads to an equation 

r^n-M /, + iVl y(x + i) - y(i) = + • • •. 

X* 


where the term in brackets is of the form 

«»+ 4 +-,+ 

x p 

When we substitute the series for y as before, we discover at once that y 0 , y lt ... 
are in general uniquely determined in succession with y 0 , for instance, equal to 
Vfe 7 —l). This determination fails when t 1 is l, and then only ; in this case we 
may take y = o. 

But when y = o, inspection of the bracket shows that if the leading term 
* 

in P(x) is xi'' (fr<p). then the series in brackets begins as follows: 


i — - 

PX p 
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and the equation in y may be written 


Jy + 



(*<?)• 


Consequently by writing 


>(*) 


L x' 


c fi = , vl*k y„ ... are determined in succession and a solution is 

” “ S.JX- »i-« «■>“ ! * “ 

case the original equation in y becomes 


Jy^x* 


('• +i i + ) 


which can be at once satisfied if we write 

9 (x)-*r* 1 (». + -,+ ) 

' *» 

With u — |J(r+ l), determined, unless indeed r is a multiple of ./j> with 

and a term in *"■ actually appears on the right-hand side. In such a 

case we may eliminate this term by writing 

y(x) = y{x) + %4»)plogx, 

and proceed as before, except that the constant term in the series for V is 

"^Consequently in all cases whatsoever when «(*) is of type (6) or (6’), a solu¬ 
tion is obtained. It wiU be noted that this solution is of the same type unless 
it involves a Unear logarithmic term clogx. This last case of type (6 ) arises 
only when p-o, P(x) = o, and a term in appears in »(x). 

Thus the lemma wiU be proved if we can deal with the case when *(x) 
is of type ( 6 "). We shaU dispose of this case by showing how the solution of 
the equation Jy=s,(x). in which .,(x) is of the type (6") with logarithms which 
enter to the i-th degree, can always be reduced to that of a similar equation 
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in which l is reduced by unity and so finally to an equation of the type l = o 
already disposed of. 

To achieve this reduction, suppose that we have to consider 
Jy = s,(x) — *(x)(logxy + */-i(x) 
where .?„(./•) is of type (6), (6'). Write 

y(x) = .v(x)(logxy 4 //(*). 


where S(x) is a solution of 

J*{*) = *o(*). 

which we know to exist of course, according to what has been proved above. 
Upon substitution we find that y has to satisfy the corresponding equation 

Jy =» * 0 (x) (log x)‘ — fo(x) + *(x)) (log (x 4 - l))' 

+ -v(x)(log x)' + ^i-l(x). 

But the right-hand member may be written as a sum of three terms 

— *<>(*) ((log (x + i)/ - (log xV] — *(x) ((log (x 4 l))' — (log x)') 4 - */-i(x). 
Inasmuch as the identity 

(log (x 4 -1 )y - (log xy - (log x 4- j (log x)y - (log xy 

— l(\ogxY~ l J\OfrX 4 - • • + (^logxV 

obtains, where 

^lo K x-log(i + = + . 

it is clear that the first term in this sum, as well as the last, involves logx 
to a power not exceeding / — i. Moreover, on account of the presence of the 
factor J log x, the middle term will have the same property unless 5(x) involves 
logx. Hence it is only necessary to examine into this last possibility. 

But it has already been noted that the solution of an equation of the type 
Jy — i ? 0 (x), where s 0 (x) is of the form (6), (6'), can only involve logarithms linearly, 
and that this can only happen in the case /x = P(x) = o when a term in X“‘ 
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appears in Therefore the only case requiring further cons.derat.on .. 

in which the equation takes the form 

Jy = s 0 (x) {log xY + *-«(*) 

where 

' X* 


and where *„(*) contains a term in x~‘ 
from this single term, and that if we can 


It is clear that all the difficulty arises 
find a solution for the simple equation 


Jy = [J log *)(log*y. 


the difficulty is disposed of. But if we substitute 




we find 

jr, - uiog*)(iogxy - r J r? i(iog(x+ or* - (iogxy +, i 

= _ ^(logxy-'i^iogx)*- -j-^uiogxr 1 . 

Now on the right hand side there appears a sum involving logx to the (f-.)st 
power at most. Consequently in every case we can reduce the problem to 
of a similar type but with / decreased by at least one. as we des.red to prove. 

Thus, whatever be the series j(x) in the equation of the lemma, we are 
led to a corresponding solution of type (6), (6'). (6"). and the lemma is fully 
established. 


§ 6. The Formal Problem and Formal Reduclbllity. 

Let us term the equation (4), namely £(jr)= 0 , reducible' in case we may 
write symbolically 


L(y) = Mn^iiLAy)) 
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where M*-* and L* are difference expressions of the same type as L, of orders 
n — d and d respectively. 

Now if there exists no equation L(y) = o without a formal series solution 
(6), (6'), we have seen that a full quota of formal series necessarily exists (§ 5), 
and the formal problem is solved. In the contrary case there exists an equation 
L[y) = o of least order n> 1 for which no solution (6), (6') exists. 

Such an equation E must necessarily he itredticible for any admissible basic 
integer p. 

In fact if we could factor L symbolically as indicated above, then every 
solution of La(y) = o would also satisfy L(y) = o. But the equation Ld(y) = o is 
of lower order than L[y) = o, and so possesses at least one series solution (6), 
(6'), by hypothesis. Hence L{y) = o would possess the same solution, which would 
be absurd. 

Moreover, if ice effect any change of variables in E of the general form 

the new equation in y must obviously also be irreducible in the same sense, no matter 
how the integer p, the polynomial P(x), and the constant r be chosen . 1 

In consequence we proceed to consider several cases in which specific types 
of reducibility are established (§§ 7—10). It will then be easy to prove that 
such an equation E cannot exist, so that a complete set of formal solutions is 
thereby’ proved to be present always. 

§ 7. First Reducible Case. 

An equation L{y) = o with lc> 1 values of p, say p t , p t , ... pt, with ^, = 0 
and pi<o (» = 2 ,...£), is necessarily reducible with a symbolic factor of order d 
equal to the number d of non-zero roots q coiresponding to p t = O. 

In the first place we observe that by definition of pt, the hypothesis made 
ensures that the first segment of the broken line in the (*,j) diagram (Fig. 1) 
is horizontal while the later segments have positive slope. Hence, after the 
leading coefficient of L(y) = o is made equal to 1 by division through by a 0 (x), 
we may write (4) in the form 

1 Note that this change of variables leaves the equation of the same general form (4), although 
the basic integer p may be altered. 


536 




224 


171 M . M „.W in the equation so obtained can contain 

where the ser.es «,(*). «sl*»-''' a ’ W 

J while has a constant term not aero, corresponding 

n0 positive powers ., ll0riz0 „ tal segment. Furthermore it is apparent that 

“ the aeC °l)lT,>d are series of the same type but lacking constant tanns. 

thS Tptose - search for a symbolic factor of the form 

(8) Wr , y )- y (, + d) + P,(x)y(x + d-) + 

where »(x). • • ■** ^ 

tabling consta* ™ eJplic it notation Ux.y) is used instead of My), 

" £££. Lcasion to Change the x in the expression to x + „ Now 
it La is such a factor we must have 

(9) iW.Uxts-^ltMxlWxts^-.yH + A.Mx)Mx,y). 

. . ... , (,\ to be similar power series but lacking 

Here we shall ^ ^ j( 9uch an identity is to hold. 

constant terms . • ^ that „ the coefficient, of y(x+»).•• • »W 

,. J" £ ” ,Lt the constant terms agree on both sides, in virtue of the 

SiTl! made of the constant terms in „(x, and Let us cons.der 

-1 thpse coefficients. The comparison under consideration 

r:r s —i “ -—- - - - ■— 

equations between these coefficients: 
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tly) . ,(x + ») ♦ n,(x)y(x + n — .) + - + «.WfW = o. 


(io) 


a,(x) * p,(x + »“ rf ) + M*)» 

a,(x) - p,(x+ n-d) + A,(x)j»,(x + n-d-l) + A,(x), 

a„_i(x) « A,-d-i(x)pj(x+ i) + A^(x)ptf-ilx), 
a„(x) ■ A«-d(x);»d(x), 


where the law of formation is obvious. 

Since the constant terms *•.»• in A 4 ,p< have been spec.faed 

comparison of the terms in gives the following . equations: 


stated, the 
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a \\ = P\\ + ^-ii * 

°*i = Pti + *nPio + 

(II.) . 

flu-1.1 = 17><f—1.0, 

«nl = ^B-d.l P.10, 

_ J 

where we employ the notation /<j, to denote the coefficient of a: »’ in any 
series f{x). Hence the last written equation determines ?. H -d,i since }Uq? o\ 
the next to the last equation determines i, and so on, until the (e/+ i) th 

equation determines A,,. But then if we turn to the remaining first d equations 
we see that these in order determine p ll% p tlt ... pd\. Thus the coefficients of 

_ j 

x p in A,(x) and p t (x) are uniquely determined by this comparison. 

Next we may proceed to the determination of the second and higher order 

_ t 

coefficients A,* and />,* by comparison of the coefficients of x v for k=* 2, 3,.... 
Equations are obtained of a similar form 

<*\k=pik + A, k + j4u, 

<*u “ Ptk + luPi0 + Ajk, 

(n*) . 

a n-l.k = l»-d-l.kPd9 + An-d.kPd-t.O + An-l.k, 

a Ht = K-d.kPdO + Ank, 

where /!,* are known functions of the coefficients pij, Ifj [j<k ). 

Thus for k = 2, for instance, we see that A <a and pn are again uniquely 
determined, and clearly this process may be indefinitely continued, so that L(y) 
can be symbolically factored as stated, and the statement under consideration 
is proved. 


§ 8 . Second Reducible Case. 

An equation L[y) = o icith a tingle value of p = o, and so with a charactnistic 
equation in q of degree n with no zero roots (§ 2), is necessarily reducible unless the 
n values of ? are all equal. 

20 — 20643. Ada maOxtmalxca. 64. Imprim* to 14 «vril l»30. 
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The equation £(,)-o under consideration is of course still taken in the 
allowable form ( 7 ) above. In the case before us the coefficents «,(*) 

evidently power senes in negative powers of «?. of which at least the last, 
„ M contains a constant term a n o- 

If the n values of , are not equal, let p, denote some particular root o 
Uinlicitv d<n. We now attempt to find a symbolic factor £*(*.*) of order 
!<„ corresponding to this root, which shall be of the same fonn (8) as m the 
preceding Paraph, except that ^ “ 

power series in negative powers of x> with constant terms as follows: 


(. 2 ) 


PlO 


-dQ^Pto 


did- 1 ) 

1 .2 


Pd o 


(-0V 


Hence the single characteristic equation of !*(*.,) = 0 is necessarily k-tf-o. 
Furthermore we write L(y) once more in the form ( 9 ), except that the senes 

(.-I, 2,... n-dl, are power series in negative powers of x> whose con¬ 
stant terms we proceed to specify. 

In the first place we observe that the characteristic equation of the sj mbolic 
product is clearly 

[t-tW* + W * -1 + + X —“ °- 


Hence this product will have the same characteristic equation as Hy) = o « 
are properly determined, and this determination is obviously um- 

que, with 

PT* + A.otf-*-* + • • ■ + t* o, A^.o * o. 


At this stage then, and >.,» are uniquely determined, and the constant terms 
in the coefficients on both sides of (9) agree. The equations (10) simply state 
the equality of corresponding coefficients of course. 

1 

We have next to consider the terms in x ^ on both sides of (10). The 
equations obtained differ somewhat in form from (n.) inasmuch as the constants 
A 10 need not be o in the case before us. These equations are in fact 
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But the equations (13.) are the same equations as result from the algebraic 
identity in w, 

( 13 ,') u ’'~ i ~ Ad[u)A*-d-\(u) + Bn-d(u)IId-i(u), 

where Ad and Bn-d are the known polynomials 

Ad(n) ■" (m— QiY m K 4 + Pio' ,d -' + • • • + P**' 

Bm-d(u) - K"" 4 + + •” + A—-.0. 


and yf„_a_,, /7a—1 are to be determined from the equations 

.rf-i-iM - A,,*- 4 - + + • + A~U. 

fld-i{u) -Pn** 4 -' + P*. 


in which the coefficients are arbitrary. 

This is readily seen if we write the j-th equation (13,) in the form 

aji = 2 (Aio/ty-i. 1 + An/»>-*.•) (j — I,... »), 

f 

where we take 4>= l. J,, = 0, I. A, =0, and all the subscripts are to be 
positive or zero of course. 

But the polynomials Ad, B n -*i are relatively prime, while A n -d- i and J7 rf _, 
are entirely arbitrary polynomials of degrees n — d— I and d— I at most respec¬ 
tively. On the left of (13/) stands a known polynomial of degree m-i utmost 
in u. It is then a familiar and fundamental theorem of algebra that A n -,1-1 
and /Id-i can be uniquely determined so that this identity holds. In conse 
quence A,, and pn are uniquely determined by the stated conditions. 
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More generally if we compare the coefficients of * ? on both »*»<*<“* 
we obtain a set of equations (.3.) related to (.3,) as the equahons (1are to 
) More precisely, the second subscripts 1 are changed to *, and in 
there appears on the right of the j'-th equation a further term A t t is 

known polynomial in the coefficients ' P‘‘-' us “ 

at once that 1>„. >*,. P». • ■ are uniquely determined in succession, and 

statement under consideration is established. 


§ 9. Third Reducible Case. 

Suppose that the difference equation L(y) = o has a single value of p, namely 
M = 0| and that the n roots of the characteristic equation are equal to I, so that 
the difference equation may be icritten 

(l4) L(y) cm + b x {x)J-'y + <•• + M*).V - o. 

1 

,ohere »,(*).... b,{x) are formal power series in negative ,w,cers of x" without 
constant terms. Suppose, however, that the u series 

xfc,(x), x*6,(x), ... **M*) 

do not all contain only negative powers of x> with or without constant terms. Then, 
unless the modified equation in z, 

z(x+n) + fc,(x)f (x+ n -1) + — + b n {x)z(x) - o, 

has a single (negative) value of p and all the roots of its characteristic equation 
are equal, the equation (14) is necessarily reducible, at least after the basic integer 
p is replaced by Ip. 

Before entering upon the proof, it is interesting to remark that, from a 
formal point of view, a difference equation of type (14) »» muc ^ ,,,ore c * ose l)' 
allied to an ordinary linear differential equation than is the most general dif¬ 
ference equation (4). This fact explains the usefulness of the difference notation 
employed. We note indeed that the above difference equation is to be considered 
as analogous to an ordinary linear differential equation with an irregular singular 
point of rank at most 1 at x = «. On this account we may expect a new type 
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of characteristic equation, analogous to that of the corresponding differential 
equation, to play a role. It will be our first step to introduce appropriate de¬ 
finitions. 

In analogy with the definitions of § 2, let us denote by b tJi the coefficient 
—it 

of the leading term in of the power series for 6/(2:), with 6 0 (*) M 1 and so 
; 0 = o. Furthermore, if Mx) w *re to vanish identically, the equation (14) would 
be obviously reducible, so that we may take j H to be finite. 



Now mark the n + 1 points (i t j t ) as in Fig. i, § 2, and draw the cor¬ 
responding broken line, concave upwards, whose vertices are chosen from among 
these points, while all of the remaining points lie on or above the line (Fig. 2). 
Evidently this broken line M starts at the origin with a positive slope, which 
is less than ]>, however, according to the hypothesis of the italicized statement. 

It is obvious that 31 is precisely the line L of § 2 for the modified equa¬ 
tion in In accordance with the italicized statement, we have to prove that 
when this line consists of more than a single segment, and even when it consists 
of a single segment but the roots of the characteristic equation are not all equal, 
the equation (14) is reducible. 

If the broken line M is made up of more than a single segment, let d < » 
be the integer which gives the coordinate * of the right-hand end point of the 
first segment. From the geometric construction it is clear that if we write 
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K =jjd so that x 
must obtain 
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is the slope of the first segment, the following inequalities 
j, £ x»\ (i ^ d)\ ji > xi, (i > d). 


We may write x in the fonr. m/l where the positive integers IZ i and m are 

relatively prime. . . 4 . „ . ,, . 

Thus the equation under consideration is essentially of the lorm 

m » »■ (*—n» . 

(l4 -) x~> Hg)~x> J'! + x ' + (o<*</>), 


where <!,(*). •.. «.(*) are power series in negative powers of x"\ among which 
r,,(i) at least contains a constant term while the series <*(*) for i > <1 contain 

no such term. 

In this case when the modified equation of the lemma in z has k > 

values of ,i of which the first is p,-x so that *>-#. the corresponding 

characteristic equation is 


U') + e, .(<?«'•) + +Cd0 = O. 


so far as the non zero roots are concerned. 

We shall show that such an equation is reducible if the basic integer is 
taken to be lp. x With this change of basic integer, x is replaced by ljd/d, an 
integer. Of course the relation x<]> continues to hold after this modification, 
where j> and x refer to the modified basic integer and modified x. 

Now let us introduce the difference operator J such that 


Ju = x p J u. 


We find then 

—» 

Ju — x* Ju 


and furthermore 

-- 1 

' \s a matter of fact the symbolic factorization accomplished only involves powers of jP 
in |)ic coefficients, so that the stated redocibility is effective for the original basic integer. We 
omit, however, the proof of this fact, which is easily made directly. 
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This gives immediately 

x p d'u = d'u + 0?»(x) d'u + ^’’(x) du, 

\ 

where 0W(x), 0^ ] (x) are power series in negative powers of x p without constant 
terms. 

Moreover we find that in general for m £ I, 

mu _ *~* 1 _ 

(, 5 ) x 9 d m u = d m u + 

i -0 

1 

where 0j m, (x) are definite power series in negative powers of x p without constant 
terras. This has already been demonstrated above for m—1,2, and is readily 
proved in general by induction. We have merely to observe that we have 


x > d m + l u = x 9 lx 9 d m {du] 



if the formula holds for the particular m under consideration, and to expand 
the terms involved. To this end we note that since 

d(uv) = m(x+ i )dv + (du)v % 


we have 


d(uv) = m(x+ i)dv + (du)v. 


Likewise we find 


d' (m t') = d(u(x + i)dv + (du)v) 

— m(x+2 )d*V + 2 du[x+ l)dv + (d S u) V 
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by another application of the same formula. Thus in general we readily deduce 
the identity 

(l6) >(«t) = «‘x + k)Sv + kj.b+k-1 )*-'» + - + W v - 

hich ia of familiar form. Applying this expansion, the formula (. 5 ) 

" h ‘ + , • the fact that the operator J applied to a power senes actually 

the degree of each term is to be borne in mind. Thu, (. 5 > i» ^Ushed 

^ ind By Ct u, n e of (.5). the difference equation M may evidently be given the 
form (14 ) : 

I x” l\y) = *■ ciM^-'y + + *•(*)» “ 0 

1*4 ) 

1 

here 5 (*) e,(x) are power series in negative powers of *' in which a con¬ 

stant term'certainly appears in HaW but not in any c t (x) [»' «><«■ More p«- 
ciseiy, it is apparent that the constant terms in c,(x). .. - <*(*) »™ ‘he same 
in Ci (x) . ■ • Cd(x) respectively- 

We propose to demonstrate that this expression admits of a symbolic factor 
of the form 

Uii) - + + J»Wjf 

where p,(*),.. - JmH are power series in negative powers of x> with the same 
constant terms a. c,(x).. - - *M respectively. To achieve this factor.zat.on, we 
shall set up an identity of the form 


(I?) "j £(,) - S-'(Uv)) + W) + + kn^xiUj,) 

I 

where A,(x). -. - A»-d(x) are to be power series in x” without constant terms but 
not otherwise specified at the outset.' In order to effect the comparison of 
both sides we need to expand terms of the type 


Note the formal analogy here and later with the method used in the preceding paragraphs. 
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and to do so we employ the identity (16) which gives the expansion 

y;,(.c+ n~d- i)z/"+ (n—d—i)./pj{x+ n - d-i - f/ + 

This leads to the explicit identities 


CiW*i>i(ar+n-rf) + A,(a:), 

r,(x) = p t [x + n—d) + (»i — d)Jp,[x + n—d— i) + A|(-r)p,(.r+ m— d— i) + l,(x), 

(18)... 

c,—i(x) ■ (n—d) r-*-'p a (x+ i) + (w-t/— i)l l {x) 4 m -*- 1 p d (x+ i) + ••• 

+ A,_d_i(x)y)«i(x+ i) + ./■-*'p 1 ,_,(x) + + ••• 

+ L-d{x)p.,-x{x), 

'"«(■*•) = + X x {x)J m - t - x p 4 {x) + •• • + l^(x)^(x). 


The close analogy of these equations with the equations ( 10 ) of § 7 should 
be noted. Since the operation J applied to any power series in negative powers 
1 

of X 1 ' lowers the degree of the initial term, it is clear that the coefficients of x >' 
on the right-hand sides can only involve the constants It), pi) for j £ k, and that 
the terms involving the operator J can never give rise to terms involving A/*, p, k . 
But, when these terms are omitted, the expressions on the right are identical 
with those on the right in ( 10 ). 

It is apparent that, because of the way in which A 10 , pto were chosen, the 
constant terms on both sides agree. Also, according to the preceding remark, 

— 1 -1 

the comparison for the terms in x* ,x* .... will determine Xn, pn, ht, pa,... 
uniquely, the equations of determination being precisely like ( 11 ,) for A-=i and 
(lit) for A-= 2 , 3 , ..except that is replaced by Ctj of course. 

The stated reducibility is thereby proved in the case when the broken line 
M consists of more than a segment so that the modified equation in z has more 
than a single value of /*. 

There remains then the case when there is but one such value of /t > — p, 
in which case it is necessary to establish that so long as the roots of the single 
characteristic equation for s are not all equal, the equation ( 14 ) is still reducible, 
at least if Ip is taken as the basic integer. 

30—29543. Aria mathrmotira. 54. Iniprimc I© 14 nvril 1930. 
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If we write Q = Qe* in this characteristic equation, it takes the form 


P + c.oe*” 1 + ••• + *»o=o, 


(CnO^O), 


and the equation in ~Q also has not all equal roots. 

Now in this case we can still adopt the preceding form (14") although here 
,he last coefficient «.(*) has a constant term not aero. Furthermore if e, is a 
root of the equation above of d fold multiplicity (d<»). we may again propose 
[ symbolic factorization (17' "here we choose 


P10 = - 1 ''' P '"’ _ ' 1 ’ 


and then i,„,... W so that the following identity holds: 

l«—P.V* + - + *•-*.) “ 

«- + e 1 ,«- 1 +- + e ... 

In this way the constant terms on the two sides in the coefficients of (17) arc 

made to agree. Likewise the comparison of the terms in 1» clearly yields 
equation, of the form (13.) "hich determine L,.p„ uniquely; more generally. 

the comparison of terms in (*> 1) leads to equations related to (13.) just 
as the equations (ill) are related to (11,). and so determine X„,p„, X„.p„, ■ ■ ■ 
in succession uniquely. 

Thus here again the desired reducibility is established. 


§ 10 . Fourth Reducible Case. 

If the equation L(y)=o may be written in Ike form 

+ M*)jr = o. 

where &,(*)■... M*) are power series in negative powers of x- with or without 
constant terms, then the equation is reducible with a symbolic factor oj degree I. 
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This case is analogous to that of an ordinary linear differential equation 
of Fucbsian type and the fact stated above is easily proved . 1 
If we substitute in this equation 

y = ^[yo + yi xP + + "1 

and compare coefficients of like powers of x, we find a set of equations 

/(*)*•- O, 

/(*- I) * + *•,= o. 


where we have written for brevity 

f(l) = 1 ( 1 - 1 ) (/-«+ l) + b l 0 W- 1 ) • ••(/-« + 2 ) + + bn o 


and where F t is a known polynomial in l, y 0 , y,,... yi-i for any ». 

But the polynomial / in / is clearly of the n th degree and has obviously 

at least one root l such that fare not roots. For such a value 

P V 

of /, it is evident that the first equation is satisfied for any y 0 , while y,,y„... 
are successively determined uniquely by the following equations, as soon as y 0 
is assigned, so that a formal series solution of this type is determined. However, 
if this series be denoted by y(x), and if we write 


y) = y(*+i)“ 


g(x- m) 

g(x) 


y(x) 


lg(x+ i) 
\ g(x) 



it is clear that we may express L(y) in the form 

L[y) - Mn- iU,(x,y)) + c(x)y 


where M n -i is a linear difference operator of order ti — i .* But if we substitute 

• See N. E. Norland, Differenzenrechnung, Chop. II. 

1 We have merely to employ successively the relations 

y(x+ n —,) - 1 "~ 2) * J,(X+V) 


in order to express L(y) in this form. 
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for y 


the 


series ,(*) we conclude c(z)j(*)-o, so that c(x) is identically sero. 
it is clear that Ely) is certainly reducible with symbolic factor 
nd'the proof of the italicised statement is thereby completed. 


§ ,1. Completion of Solution of Formal Problem. 

We are now prepared to resume the argument begun in § 6 . 

If an equation E of the type there specified exists, and if • /■• ore 

the corresponding values of we can at once prove that * must be ,. In fact 
J 4>li let be the greatest value of Make the tran.format.on 

ei* 

$• 

„hich is at once verified to yield a new equation of the same form with values 
of '* i.#*»—#*• 

of which one is zero and the others negative. By the result of § 7 e « nft - 
,ion would be reducible, and admit a symbolic factor LA’l) of lower order, having 
„t least one series solution ( 6 ). ( 6 ') shared of course by the equation E. Tins 

would be absurd. 

Hence the equation E has only one value of p, say wh.cl>, b) the 
transformation already employed leads to an equation E for which this value 

" f Now this modified equation E will fulfill all the conditions prescribed in 
the italicized statement of § 8 and so be reducible unless all of the roots of 
it, characteristic equation are equal, say to e,. The further transformation 

y =°tfy 

will then lead to a like equation E in which these equal values of g are replaced 
by i, since the effect of the last transformation is to divide the roots of the 

characteristic equation by g r 

Consequently we are led to an equation E of the form (14) whlch would 
fulfill the hypotheses of the italicized statement of § 9 or of § 10 and so be 
reducible, unless the equation in z has only a single negative value of say 
-z, with x<p, and all of the roots of its characteristic equation are equal. 
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However, in this remaining case, since the characteristic equation must 
contain a complete set of terms', a first conclusion is that x must be a positive 
integer and in addition that the equation under discussion has the form (14') 

where the coefficients c,(x) are power series in negative powers of X*, with con¬ 
stant terms as follows: 


’10 




m(m— l) 


I .2 


e;. 




where p, is the single value of Qe p . 

Suppose now we make the transformation 

in (14'). Evidently this transformation takes the equation K into another of the 
same type K. In order to characterize it in more detail, we substitute 

,/*y = + »V/g»(x+« — iM*" 1 # + • + U>lx))y 

for 1—1,2,... n and divide through by y(x + n). The equation obtained may 
then be written 


<p{x+ w) + x p c^J'-'ylx) + • • + X * c*(x) <p(x) j ij = 0. 


It suffices for our present purposes to observe that every coefficient so obtained 

j 

can contain no positive power of x p or even a constant term, and also that the 
coefficient of is given by a series in negative powers which starts off 

with a term of lower degree than -x!p. In fact we find readily 


w./g>(x + 11— 1) 
rp{x + n) 




x>' r,(x) 


rp(x+ n— l) 
9>(x+n) 


-«e,x 


>' + 


so that this is the case. 

* All of these terms must appear when all of the roots are equal. 
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But the equation E thus obtained has of course, for reasons already pre- 
nt€ d but a single value of u - and all the roots of its characteristic equation 

SeD a l ’ It wi u therefore be reducible unless it is of the exceptional type (14) 
eq hich the modified equation in * has the properties stated. Of course it cannot 
le of the type treated in § io, which is always reducible. Moreover, by the 
ument given above, the value of i for the equation in y must again be a 
Positive integer, and, on account of the nature of the coefficient of 'y, * 

!riU be at least one more than in the first equation. 

Thus, step by step, we are able to increase the integer x in the equation 
(l4 '), until at last we are led to the case x^p. But this has been proved to 
be reducible in all cases (§ 10). 

Hence such an equation E cannot exist. Thus we have proved the following 
fundamental result: 

Any difference equation (4) (or difference system (1)) in ichich the coefficients 

are formal power series in negative descending powers of x» (p£i), sueh that 
a 0 (x)*o, a H (x)* o (|a«j(z)|*o) admits of n linearly independent formal series solu¬ 
tions with elements of the types ( 6 ), ( 6 \ ( 6 "). 


§ 12. A Theorem on Keducibillty. 

Let .«,(*), jt„(z) be a set of n linearly independent formal solutions of 
an equation (4). It i» apparent that »,(*).... «.(*) are not determined at least 
up to a constant multiple of e"*’- 1 ' "here I is an integer, and that if anv sum 
such as 

+ •• + '***''' ~ l ' 5m[x) 

is of the type ( 6 ), ( 6 '), or ( 6 ") the set *,(*),.. • s m (x) admits of still further modi¬ 
fication. We shall refer to the complete collection S(r) so obtained as the family 
of formal solutions. Evidently it has the further property that if S(x) is in the 
family of formal solutions, so also is every determination of s{x) for the given 
basic integer p. Any subset of formal series of types ( 6 ), ( 6 '), ( 6 ") with this 
double property will be termed a ’natural family* of formal series, and the number 
of linearly independent elements in the family will be designated as its order'. 

We propose to prove the following 
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Theorem: To any decomposition of L(y) of order n into irreducible symbolic 
factors L lt L t , ... Lt so that 

L sb LkLt-i ... L t , 

there corresponds a sequence of natural families of formal solutions I'\, ... Ft each 
containing ihe preceding as a sub family, but such that there exists no further inter¬ 
mediate natural families, and such that the general solution of L x (y) = o ts furnished 
by F lt of L,(L,(y)) = o by F t , etc. 

Conversely to a set of families of formal solutions F lt F t ,... Ft, each con¬ 
taining the preceding, but in such wise that there exist no further intermediate fa¬ 
milies, there is a corresponding irreducible factorization which is essentially unique. 
Let us establish the first part of this theorem. 

Evidently L,(y). L t L,(y) = o,... L(y) = o define a sequence of expanding 
families F lt F t , ... F*. But if there exist* any intermediate family F* between 
F/-i and F( for instance, the equation L*(y) = o admit* Lt -*... L,(y)«=o as a 
symbolic factor. In fact we can write 

L*(y)- 3 /L,_, ...L,(y)+ Q[y). 

where Q is of lower order than Lt-x . . • L,(y). But we substitute any member 
y of Fi-x for y in this identity we conclude so that Q must vanish 

identically. Similarly we can prove 

LiLt-x . • ■ L,(y) - NL*[y). 

Therefore we have 

L[y) ■ LtLt-x ... Lt+xNL*[y) 

= LtLt-x ... Li+x NMLi— i ... L,(y), 


and it follows at once that 


LtLt-x ... Lt+iiNM-LiLi-i ... L,(y)«o. 

Hence we prove successively that the factors L*, Lt- i. • •• Lt+x on the left may 
be removed so that 

(NM—L,)L^x ... L,(y) = o. 
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„ , were not identically aero, we conld continue this process and finally 

If , d Jmo which is absurd. Consequently the factor U must be identical 
C °“ p rodu ct NM and so not irreducible, contrary to our hypothesis. 

This establishes the first part of the theorem. 

To prose the second part of the theorem we set up the equation AW- ° 
with the family f, aa its general formal solution. Obviously £, is determmed 

to a multiplication power series in/'. Likewise we let P«tv) = ° be the 
equation with genera, solution Ft tori-, k. We can then readily estab- 
,ish that L, is a factor of P„ P, of P„ and so on, w.th P. = L. 

p,mL,L„ P,-L,P„...L-UF^i. 


It follows that L is expressible as a symbolic product 

L->L l Pt-i , "L t L»-iPw" — LrLi-i Li 

To complete the proof that to such a sequence of natural families correspond 
irreducible factors L„L,,...U it is only necessary to observe that if any L, 
is not reducible we obtain further intermediate families, contrary to hypothecs. 

This theorem may be regarded as fundamental for all questions concerning 
reducibility of a single equation (4) with a given basic integer p. As an illustra¬ 
tion of this fact we cite the following two special results: 

A necessary and sufficient condition that L{y) = o of order n he completely 
reducible (i.e. expressible as a product of n symbolic factors) is that there exists no 
anormal series solution ( 6 ') with !>,. Every irreducible factorisation will then in¬ 
voke n factors, each of the first order. 

A necessary and sufficient condition that L(y) = o of order n> 1 be irreducible 
is that the complete set of formal solutions consists of a single solution ( 6 ') with l = n, 
and Us various detenninations. 

The necessity of the condition of the first statement is obvious, since in 
the expanding set of families F kt we necessarily add all of the determina¬ 

tions of series of anormal type when we add a single one of course. The suf¬ 
ficiency may be seen as follows. At the »-th step the family F<-x is expanded 
by the addition of at least one new member of normal type. This may be ex¬ 
pressed in the form (see ( 6")1 

s(x)(log:r)« 4 - t(x) (log x)* * + - + '*(*) 
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with m^o. If wi=0 (i. e. tbe solution doe9 not involve logarithms), it is evident 
that the addition of this new member leads to a larger natural family of order 
one greater, which must then be Ft. But if m>o, the solution 

ms(x)(logx)"~ I + (m—il/lxlllogi) - "’ + ••• + r(*) 

must be in otherwise it would suffice to add this last solution only with 

its various determinations to form a smaller Ft. But if these are in Z* 7 -i, since 
/= i we obtain a natural family of order one greater only, which must of course 
be F t . Thus the irreducible factors will always be of the first order if there are 
no anormal series. 

The truth of the second statement is obvious. In fact the family of all 
formal solutions not containing logarithms yields a factor of L(y) = o unless 
there are no solutions containing logarithms. Thus for the completely irreduc¬ 
ible case we are restricted to the types (6), (6*) of series. But each such type 
and its various determinations yields a natural family and a corresponding factor. 
We conclude that there can exist one single series solution of anormal type with 
u, But in this case L(y) = o is evidently irreducible. 


§ 13. Equivalence, Normal Forms and Invariants. 

There is a second method of decomposition of natural families of series 
solutions which is of importance in dealing with the notion of equivalence, 
namely that in which it is sought to break up the given natural family into a 
maximum number of entirely distinct natural families. Inasmuch as the formal 
significance of this process is most apparent when we use a linear system (1) 
instead of a single equation (4), we introduce the matrix notation at this point. 

In matrix notation we may write the system (1) in the form 

y(x+i) = ^(x)y(x) (MMI*o), 

where }'=6’(./-) will be taken to denote the matrix of formal solutions, one in 
each column of S(r), which are such that |S(x)|^o. 

Now suppose that we have a second equation 

f(r+.)=iwy(*), 

31 — 29643. AtUi maOtnruUiea. M. ImprimA I© I mii 1930. 
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Which is obtained from the first bj a linear transformation 

y(i) = B(x)r(i), 

so that __ 

A(t) = B-‘U+ iMW«W (|A(x)|s*o). 

lf the elements of B[x) are power series in*”' (p, the given basic integer) we 
hall say that the equations in I' and 1 ' are properly equivalent'. But .f the 
quations in Y and Y are only properly equivalent* after p has been replaced 
I! a suitable multiple of p t we shall merely speak of the equations as improp¬ 
erly equivalent*. We propose to deal here only with the latter type of equi¬ 
valence. Evidently the relation of equivalence in either sense is transitive. 

It is clear that equivalent systems possess formal solutions which are of 

essentially similar type. ... , , 

For the purpose before us we group the formal solutions s,(s) according 

to the exponential factors 

#•/ 

which occur in the series for each particular solution. Evidently all the solu¬ 
tions then fall into a set of entirely distinct families, provided that we regard 
two factors as essentially the same if the constant ,« is the same in each and 
the two polynomials P(x) differ at most by a term 2 ltt\ -ix (l, an integer). 

Aside from this factor, we have in any particular set of this kind a linear 
family of precisely the same type as presents itself in a linear differential system 

with a regular singular point at x = ». 

On this account, we can list the formal solutions in entirely distinct families. 

each of which has the following form 

s,i(x)m logx + f<i(x),... fti(*)mlog x + t„[x). 


ft! (log*)* + + «*||(X), - - . S.lM(logx)" + — + 

in which ft,(*),... *«(*), *«(*)•• •'«(*).--• all involve precisely the same expo¬ 
nential factor and »i£o indicates the degree to which logarithms enter. Fur¬ 
thermore for all of these elements the constant r may be regarded as the same. 
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Suppose that there are k ^ i such families with complete exponential factors 

which we designate by e,(x) t ... e t (x) respectively. If we denote by 3 ijt w t) 
the parts of the respective series s,j, t,j, ... to tJ in such a family aside from the 
corresponding exponential factors, say e,(x). it is clear that the matrix of formal 
solutions can be expressed as a product matrix of the form 


S,| fn ... 5 (l ... 


e, (x) e,(x) m log x ... e, (x) (log »)"• 



o e,(x) ..■«»(*)(logx)**" 1 ... 

All • • • 3h1 ... 


oo ... <*(*) 


which may be written B(x)E(x), where B(x) is made up of elements such that 
the linear transformation Y(x) - B(x) Y(x) takes the given equation into an 
(improperly) equivalent equation. But this equation in Y has E(x) as matrix 
solution so that we have 

A(x)-£(x+i)i w| (*)- 

Hence we are led to the following result: 

Theorem. An arbitrary system (i) of this type is (improperly) equivalent to 
a normal system 

Y(x+ r) =■ 2 (x) ?(*) (!(*) = E{x+ ■)£-'(*)), 

in which the matrix E[x) has elements o except for square blocks along the diagonal 
of the form 

e[x) e(x)m log x ... e(x) (log x)" 
o e(x) ... etxjtlogx )"- 1 


where 


oo ... e(x) 


e(x) = x* 



ip-. 

= yx + dx ,p 
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. i, an integer. Furthermore with any e(x) and m there are alto l -1 blocks 
he olher determinations of e{x) and with the same m in ease I > I. 
Evidently the functions efcr) and the integers m are to be regarded ae in- 
ariantire characteristics. In other words in order that two systems (.) be 
r oroperly) equivalent it is necessary and sufhcient that they possess the same 
let of functions e(x) and corresponding integers m, each with the same multi- 

Pl ' C ' t This theorem enables us to deal with most questions arising out of the 
uotion of equivalence. We cite merely the following two special results, which 

are readily proven: 

There a set of n distinct natural families of solutions if and only 

if the formal senes are of normal type and no logarithms enter. In this case the 
normal form is as follows: 

— *r e 

y,(x+ «)-(.+ *)’ * T J,(x) (j- I,... 


in ichich ore integers. 

No two distinct natural families of solutions can be found when and only 
when there is a single exponential factor e(x) which is not repeated, together with its 
various determinations. 


§ 14. Difference and Differential Systems. 

It has been stated above that in certain cases difference and differential 
systems are closely related from a formal point of view. If we bear in mind 
that the formal solutions of linear differential systems 

Y'(x)-B(x) Y(x), 

in which of course the elements of B(x) are formal power series in descending 

powers of x>, are the same in type as those of Unear difference systems (1) for 
which p = o, we conclude that the matrix S(x) of formal solutions of a Unear 
difference system is also the solution of a linear differential system if every pi 
is o. If we fix the arbitrary exponential factors e 11 **~ Xx in these formal solu¬ 
tions, this corresponding Unear differential system is uniquely determined, since 
we have B(x) = 5'(x)5 _1 (x). 
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The fact that the polynomials P,(x) which enter are of degree not exceeding 
i in x indicates that the corresponding differential system has formal solutions 
which are of rank at most i. 

Thus we obtain the following theorem: 

Theorem: A necessary and sufficient condition that a linear difference system ( i ) 

Y[x+ i) - A{x) r(x), 

in ichich the elements of A (a-) are descending formal pouer series in x p , be form- 
ally compatible with some linear differential system 

Y'[x) - B(x) Y(x) 

(i.e. that both admit a common formal matrix solution S(x)) is that the values of /« 

in the formal solutions of the difference system are all o. Here the elements of li(x) 

\ 

are also power series in descending powers of x p , and the folloicing formal identity 
holds: 

A'(x) = B(x+i)A(x)-A(x)B(x). 

Conversely, if a linear differential system of this type admits of formal solu¬ 
tions which are all of rank at most l, it will be compatible with such a linear 
difference system, and the same formal identity trill hold. 

The formal identity is obtained by noting the relations 

6"'(x + i)-£(x+ i ).S(x+ i)=* B(x+ i)A(x)S(x) 

and at the same time 

S'(x+ i) = A(x)S'(x) + yl'(x).V(x) 

= \A(x)B{x) + A'{x)\ S(x). 

As we shall show in our second paper on irregular difference equations, 
an actual linear differential system of this type is not only formally but also 
actually compatible with a difference system, so that the solutions of the differential 
system are also solutions of the difference system. 

Hitherto we have taken the given difference system in a form involving x 
and x+l, rather than x and x + d. The question arises as to when two linear 
difference systems with distinct values of d, say d, and d t , can be compatible 
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in the form al sense specified above. Since the formal series in snob eqnat.ons 

nee seen to involve exponential factors A it is clear that at least 
Me “ 1 ratio djd, is not a rational number, every value of p must be o; 
r that a itself is either integral or rational. Hence in this case the two 
systems are both formaUy compatible with a Unear differential system 
r, the other hand if the ratio djd, is rational, and if d denotes the greatest 
“ on submultiple of d, and d„ so that ^and i, are 

relatively prime integers, we may find integers /„ f, such that «■- 

Consequently we may express *(.+< as A [x )Y { *) by repeated use of the g,ven 
difference equations. Thus we are led to the foUowing results: 

Two linear difference systems 

Y(x+d t ) - A x [x) Y(x), Y(x + d t ) = A t (x) Y(x) 

are formally compatible in the case ,ehen d,ld, is not a rational number if and only 
if they are both compatible ioith a linear differential system 

r (x) = Blx) Y(x) 


of the type referred to above. Here the following formal identity holds: 

, 4 ,(x + rf,)A,(x) = A,{x+d,)A,(x). 

In case djd, is rational and d is the greatest common submultiple of d, and d„ 

these systems are formally compatible if and only if .re have 

Ajx)mA[x+d,-d)A[x+d,- 2 d) A(x), 

A,{x)~ A{x+d,-d)A(x+d t - 2 d) A(x). 

We defer to our second paper all discussion as to the relation between the 
formal situations considered in the last three paragraphs, and the corresponding 
analytic situations. 
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§ i. Introduction. 

§ 2. On B 1 and proper curves. 

§ 3. Lemmas on iteration. 

§ 4. A lemma on summation. 

§ 5. Construction of proper solutions to the right of a proper curve. 
§ 6. A lemma on factorization. 

§ 7. Products of completely proper operators. 

§ 8, Completion of the proof of the Theorem of § 7. 

§ 9. The Fundamental Existence Theorem. 

§ 10. Connection between ’upper' and ’lower' solutions. 

§11. The converse problem. 

§ 12. The related Riemann problem. 


§ i. Introduction. 

In the present paper we study a linear difference system of order n in 
ft (*).•• •*.(*), 

(0 y,(x+i)-2 *>(*)»(*) 

‘ (|(«*j(*))|*o; »'= i, 2,... »), 

1 National liesearcb Fellow, Harvard University. 

1 — 32611. Arm matkrmatira. 60. Impriip* l« I *ept*ml>r® 1032. 
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in which the coefficients *,(*) will be taken either as known rational functions 
of * or, more generally, as series convergent for |x| > r which, except for a 

finite number of ascending powers, are in descending powers of x or of x<- 
(p, a positive integer). Still more generally it may be supposed that the coeffi¬ 
cients are merely represented asymptotically by such series in certain regions of 
the complex plane. 

The equations (i) will be written in the matrix form as follows 
(i a ) F(*+l) — A[x) 7 (x), 

(7(*)-(*,(*)); A (*) ■ (*))); 

here, for j — 1,2,... n, the elements yij(x), • • • y-j{x) in the y'-th column of 
I r (a) form a solution 

y, (*) - yu(*).• ••*■(*)- 

of the equations (i). Such a solution Y{x) of the matrix equation (i a) will be 
called a matrix solution in case | Y(pe )| ** o. 

In a preceding paper by Birkhoff* the weU known fact was pointed out, 
that such a system (i) may be related to a single difference equation of the 
n th order 

(2) L.(y)-.a 0 (*)y(z+n) + a,(x)y(a; + n-i)+ - +o,(x)y(x) = o, 

(ao(x)^o; a„(x)*o) 
by means of a linear transformation 

(3) yM -2 

J-i 

in such a wise that whenever y,(x), ...y.(x) is a solution of (1), the correspond¬ 
ing y(x) obtained from (3) is a solution of (2), and, conversely, whenever y(x) 
is a solution of (2), then the n functions y,(x), ... y„(x) determined by the n 
equations 

1 Formal Theory of Irregular Linear Difference Equations, these Acta, vol. 54 , 1930 , pp. 
205—246 (cited hereafter as (I)). 
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(4) 


n 

y(x+ i)=2^U+ Oaji.toy^x), 


ft 

y(x + n - + ” ” + *-*)••• fl/.-iJ.-iMWe-iH 

.Jft-i 

form a solution of (i). Here ^(x), ... a„(x) are known functions of the same 
type as the a,j(x) while 2,(x) _M*) are k® 0 '™ rational functions of x, ar¬ 

bitrary except that certain special conditions are not to be satisfied. Conversely 
of course an equation (2) can be related to a system (1), for instance by writing 

( 5 ) y,(x+i) = y,(x), y,(x+i) = y,(x).y-»(* + 1 ) - y„(x), 

a 0 (x) y„ (x + 1 ) * - a, (x) y„ (x) - a, (x) y „-1 (x)-a« (x) y, (x) 

in which case y = y,(x) will satisfy (2). Since in (2) we have a 0 (x)*o, without 
any loss of generality it may be supposed that o^x)™ 1. In much of the text, 
along with an equation of type (2) there will be occasion to consider the 
related system 

(6) Y(x+ i) = D(x)r(x), 

( o, 1, o, ... o 

... 

-On, —On- 1, . . . 

If }' (x) = (y<> (x)) is a matrix solution of (6) then 

(6 a) lyu(x))* to (* + *—1)) 

and the functions y, (x), . . . y«(x) will constitute a fundamental set of solutions 
of (2). The converse is also true. 

The fundamental result of the paper referred to above is that every system 
of type (1), or single equation (2), admits precisely r< linearly independent formal 
solutions with elements of the general type 
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p-, t 

(7) ««'*'»(*). Q(z) = px log x + yz + dx' + - + *x> 

where 

s(x) = af[(a + bx~i> + •••) + (a 1 + b x x~ > + ••-) log x + ••• 

(7 a) _i 1 

+ ( fl " + 6-x p+ - )log-xJ, 

p is a positive integer, pp is an integer and m is a positive integer or O. Here 
p does not need to be the same as the integer, denoted by the same letter, 
occurring in connection with the coefficients of a system, or single equation. 
The following definition will be introduced. 

Definition 1 . A formal series s{x) which is of the form (7 a) will be termed 
an s series. 

An element may be thought of as representative of y(x) for in¬ 

stance, in which case the corresponding yjx),... y M (x) are given by such 
s series with the same exponential factor Two series (7) which differ merely 
by a periodic factor (X, an integer) are considered to be linearly 

dependent. The series involved may converge but in general will diverge. 
With an equation of order n, of type (2), there will be associated w functions 

For purposes of classification the following terminology is found to be 
convenient. The difference system (1) or single equation (2) will be called nor¬ 
mal if p- 1 in all of the formal series, so that each Q t [x) reduces merely to 
(ijX log x+yjx; otherwise it will be called anormal, since then there enter anor- 
mal series with p > 1 . This agreement is in accordance with that used for 
linear differential equations. Moreover the system (1) or equation (2) will be 
called regular or irregular, according as there is only a single value of pj or 
more than one such value. Finally any difference system (1) or equation (2) is 
called singular when it is not both normal and regular. 

The earlier methods of Norlund, Galbrun, Carmichael and Birkhoff were 
applied primarily to the regular normal case 1 ; for a system (1), this case may 
be looked upon as the ’general' case from a certain point of view. 

1 Cf. N. E. Norlund, Difftrtnztn Rechnung, Berlin ( 1924 ). Sec also Birkhoff s papers Ge¬ 
neral Theory of Linear Difference Equations, Trans. Am. Math. Soc.. vol. 12 (1911) pp. 243—284 
(hereafter cited as (II)); The Generalized Riemann Problem for Linear Differential Equations and 
the Allied Problems for Linear Difference and q Difference Equations, Proc. Am. Acad. Arts and 
Sciences, rol. 49 (1913X PP- 521—568 (hereafter cited as (III)). 
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Furthermore Galbrun 1 has treated a single difference equation of order n 
with rational coefficients in the special case of the special anormal regular type 
in which a pair of anormal series enter with p = 2, so that the two correspond¬ 
ing polynomials Q[x) in V~x have the respective forms 

y(x) + d V~x, yx — i Vx. 

In a recent important paper Adams* has shown that to some extent Birkhoff's 
methods continue to apply in the irregular normal case. 

In the present paper the analytic theory of linear difference system (or 
single equation) is developed so as to apply without restriction upon the form 
of the formal series. The methods consist, on one hand, of suitable modifica¬ 
tions of those of paper II; on the other hand, an important rdle is played by 
certain new methods involving factorization and group summations. The main 
result of the paper is embodied in the Fundamental Existence Theorem of sec¬ 
tion 9. In most of the text preceding section 8 we restrict ourselves to a 
quadrant /', 

~ ^ arg x £ n + 4 (|*|>$>o) 

the lower boundary of which, h, is a portion of a line parallel to the axis of 
reals and lying below this axis. In quadrants other than r results will hold 
precisely analogous to those obtained with reference to J\ 

As a matter of notation we shall write 

Qu(x)- Qi[x)-Qj(x). 

Moreover, in addition to definition (1), it will be found convenient to set 
forth the following definitions. 

Definiton 2. A branch extending to infinity and satisfying the equation 
31 Q’, j(x) = o will be called a If curve. If there is no B' curve. 

Definition 3. Let G denote a part of r 1 cith the right boundary coincident 
with that of T. Let the left boundary of G have a limiting direction at infinity; 

1 H. Galbrun, Sur cer faints solutions eiceptionnelles d une Equation liniairt nux dyfftnncts 
finite. Bull. Soc. Math, de France, vol. 49 09 *»X PP- 206-241- 

* C. R. Adams, On the Irregular Cases of the Linear Ordinary Difference Equations, Trans. 
Am. Math. Soc., vol. 30 (1928), pp. 507—54. 
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if this direction is coincident with that of the negative axis of reals assume this 

boundary to be of the form v = h[-uY(h>o-, l>e> 0 ; x-u+V-tv). Let C 
denote a curve in G, with a limiting direction at infinity. V(x) will be said to 
possess an order k 0 along C if , as |a|-»oo along C, 



*<*o 
*>*o- 


A function V(x) will be said to be proper in G, or |« K «| will be said to be com¬ 
parable with \e lnV - ix \ in G, if along every curve C, lying in G and of the above 
description, V[x) has an order k 0 (in general, dependent on 0 ). A set Qj(x) 
(j = i will be said to be proper in G if all the Qij(x) (*, j = I. ••• w ) are 
proper in G. The region G may reduce to a single curve C. 

Definition 4 . An operator Ln(y) (or equation JU,(y) = o), with coefficients 
known in a region G and of the type, in G, which has been assumed with reference 
to (i) and (2), will be termed proper in G if the equation L„(y) = o has a funda¬ 
mental set of solutions with the asymptotic form of the formal senes in each of the 
several regions, separated by B' curves, which form G. Solutions of this kind will 
be said to be proper. 

Definition 6. A function p(x), of period unity and analytic in an upper 
half plane, will be called proper if 

(p, a constant; H, an integer) 


in a region whose left boundary is of the form v = h(—u) t and whose right boundary 
is of the form v = hu r (h > o, e > o). 

Definition 6. An operator L* (y) (or equation Ln(y) — o) which is proper in 
G will be called completely proper in G if, in G, proper fundamental sets of solu¬ 
tions exist which are connected by proper periodic functions . 

Definition 7 . A set 

Q>(x). ■ ■ <?>(*) 


has a point of division in G if this set can be separated into two groups 
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Q,(x),... Qr(x); 

(issrc.) 

SO that for x in G 

9W to *91 ?/•♦,(*) 

[X - l,... r; n =* I,... n-t). 

Definition 8. Let G denote a subregion of r, the lower boundaries of G and 
r being coincident. An operator L„(y) (or equation L n (y) = o) with coefficients of 
the same kind, in G, as in (i) and (2) will be said to be Q,factorable in G if the 
set of Q's, 

«.W. ... <3„(z), 

belonging to L n {y), has a point of division in G. 

Definition 9 . Let F be a curie extending to infinity and lying in I. Let 
Rf be the portion of V between F and the right boundary of V. The curve l and 
the region Rf will be termed proper for the set 

if this set is proper along F and also in Rf- 


§ 2. Some Properties of If and Proper Cortes. 

Let us write 

(1) Qj[x) = ttjX log x + Pj(x) (j=* >. •••*)• 

With a system (or equation) of order n we have associated n polynomials, 

1 

/^(x)l;= 1, ... n), in x p t of degree not greater than p and without constant 
terms. These polynomials occur in the exponential parts of the formal series 
17, (7 a); § 1). In this connection subscripts 1 to n are attached to p, m, r, y, 
d, ... v so as to differentiate between the n exponential factors which accompany 
the n formal power series (when we consider a single equation) or so as to 
differentiate between the n columns of the formal matrix (when a system is 
considered). Unless stated otherwise, we shall take for each of these, elements a 
common value of p and m; this can always be effected by taking these integers 
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sufficiently great. It is obvious of course that if one determination of x>'J 
and of log x is made in such a formal solution, all of other determinations of 


x v J , namely 


1 L L 

utX p J, io*x p J, . . . o)*j- l x>'j 


where a, is a primitive pjth root of unity, and certain of the other determina¬ 
tions of log x, namely 


log x + 2 rtV— 1, log x 4-4 rtV^ 7, ... \ogx+ 2 {mj-l)nV^ 


yield other related formal solutions. Thus with one such formal solution is 
associated a group of mjpj solutions which are linearly independent. 

For the present we suppose a cut made along the positive axis of reals 

i 

and take the principal determination of x p and log x on the upper side of the 
cut. In much of the text we deal with solutions the left’ in such a cut plane. 
When we 'work from the right’ there is a similar procedure, with a cut made 
along the negative axis of reals. 

The B' curves will be seen to be of outstanding importance (Def. 2; § i). 
Along such a curve 91 <?',;(x) = o; conversely, any curve for which 91 < r / ( ;(x)^o 
(/>j) is a B' curve, except when p, = pj while P'ijix) is a pure imaginary con¬ 
stant. In fact, the equation 


( 2 ) 


9 i<?'u(*) = o 


can be written more explicitly in the form 


^(log \r\+ l) 4 91 Ptj (x) = o 


= Yu + 

this justifies the last statement. There is no B' l j curve* when u,v*pj, since for 
|x| large the first member of the above equation would be arbitrarily large 
while the second term would remain finite in fact approaching 91 ;',;. Conse¬ 
quently it follows that for the existence of a curve it is necessary that 


P~ i 


iijx 


i 


Pa = Pi— Pi, Yu = Y*- Yi, 0 ,J 




1 The W curve along which <?',;(*) = o is denoted by K f ■ 
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Obviously the F curves are algebraic. Thus when a B\j curve exists it is 
given by the equation 

Rite) ™ 91 (P'o(x) - YiJi = o 
or 


(2 a) 


•[*? 


hiX 


1 

*• + 


+ 'y„x ( ' )] - 


If there exists an anormal series (with p, > 1), the p, determinations of 

\_ 

x p i yield p t associated series solutions with the same value of p and the same 
real part of y; in this case the necessary conditions for the existence of a B' tj 
curve are satisfied. 

If pi = pj= 1, there exists no Ft) curve since then Ft)(x) is a pure im¬ 
aginary constant. It is clear then that if a B\) curve exists it satisfies an 
equation 

(2b) + ••• + 

(VU= rmy^o\ \£9&p-\). 


The leading term in (2 b) is 



\c = arg 1); 


consequently there will be several B' t j curves with limiting directions a defined 
by the equation 



It is of interest to determine in what cases there are no F curves. From 
the preceding it is clear that whe must then have for all t and j either 
Hij ^ o or pij = o, SRyij j* o, or else ptj = o, 91 yij = o, Ft) — yt) = o. Suppose 
then that we consider those formal series for which pt has a given value and 
at the same time 9 iy, has a given value. For the corresponding formal solu¬ 
tions we shall have 

2 — 32811. Arm mathrmalira. 60. Imprim* lc I *ept«*mbre 1032. 
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Now if Pi(x) has various determinations, then m and are the same for all 
of these, so that these determinations fall into the same group, and P'tj-yij&o 
in all cases. Consequently !*<{?)—yt must be identically zero, so that P,(x)^y,x. 

The case in ichich there are no If curves is accordingly the general normal 
case, regular or irregular. 

The If curves are important for the analytic theory since they limit tue 
scope of validity of iterations and summations. 

In the cases hitherto treated with success there have been no B‘ curves. 

Generally speaking the B’ curves, or any other curves depending on the 
g(x)‘s are not to be regarded as fixed up to a translation. The region T 
consists of several consecutive regions separated by B' curves. These regions 
will be denoted as 

(i). (2), ... (m), ... 

where (1) is the region having for its lower boundary that of V. Unless there 
is only one such region, so that (1) is the upper boundary of (1) will be 
denoted by B'\ in general, tne right boundary of (m) will be denoted by If". 
The number of these regions is finite, the right boundary of the last one of 
them being coincident with that of T (i.e., with the positive half of the axis of 
iraaginarie 8 or a line parallel to that axis). Moreover, these regions may be 
considered overlapping in the sense that any such two consecutive regions may 
be considered as having a strip of, say, unit width in common. In the sections 
leading up to § 9 it will be assumed that If curves are simple in the sense that 
as tve pass across such a curve precisely two of the functions 91 Q' (x) arc inter¬ 
changed in order. How to meet the situation when the above is not the case 
will be apparent from the text. 

The following lemma will be now proved. 

Lemma 1 . Let B" be a If t j curve with the limiting direction of the negative 
axis of reals. Assume that it is not coincident with the negative axis of reals. 
Then 

( 3 ) Qut*) B V“ 1 (<*ijx+ atj{— as)** + - *) p + ■) 

(<7, a, ... b, ..., real; a, b ^ o; p > s t j > l'tj £ 1) 
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and the equation of B m will he of the form 

(3 a) * = <?(-*)' + ••• 

(x = u + V— 1 v\ dtj ™ p — »t) +. Hji 00). 


Moreover , in the region between a curve Fr tJ , 

r jt 

(3 b) „ = y(_,,)P (y > o) 


(lying, of course , fee/ow B m ) and the positive axis of imaginaries (o$ well as further 
to the right) the function Qij(x) is proper. 

{Vi 

Proof. Writing Qij(x) in the form (3) we note that a term in a;* with 
P> r^i actually enters, since otherwise the negative axis of reals is evidently 
the only possible corresponding & curve; this possibility is excluded by hypo¬ 
thesis. Hence bgy^o. The equation of the curve will be of the form 


MQ'tAx) 


* 9 i V^T 


'll 

*/<*«(— x) p 
p(- x) 


) 


+ 


/ ^ 

r 0 M —*)’ 

\ p(-x) 



o. 


If s,j< r,j the dominant term in the parenthesis is clearly the second one 
of those displayed. The expression in the parenthesis would be nearly real for 
|x| large. Hence there could be no B' curve of the specified type in this case. 
If st j = r t j the parenthesis has a dominant term 

'll 

y — Stjag + r tJ bij \ ( -g)* 

P P I (“*) 



and this is also impossible since btj^o. 

There remains the case S(j>r t j with aij 7*0, when the dominant term 
is the first explicitly written. Here there is actually a B' curve with horizontal 
direction to the left. Now we may write the above equation in the form 
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(% l 1 —\ ^ - a h (r) a — ■ — 


-f- 

1 ■ 

** 

rtj h 

j + 

)1 

%h(x) — o, n\x j- p-.,* 

(-*) p ~ 

\ P 

•ij-’lt 

( - x) p 

1 


Along the curve, for |x| large, h[x) will be arbitrarily near to a small real 
value. Furthermore by suitably ordering * and j we may make a,j negative so 
that A(x) wiU approach a positive value. Introducing a new variable x, 


f-'n P—*\ 


sit an 


\—x) p (- x) 


] 


h{x) becomes 


—W,[' + / -'(*■<- * ] ’ r ‘ ,+ )1 {,l ' 7 ‘ 0) - 

<-«)" 

In the last parenthesis the power series has real cofficients. By inversion it 
follows that 


r •u- r u 

+ V~=~l d t h(x)r-‘i + —J 

Hence, writing & — ii + V— i 0, we have 

-9 

so that the B’ curve in the x plane will have the form 


(<l t 7*o). 


d JI 

y = u) p + •••(</, 7 *o; p>d tJ -= p-sij-¥ 1 'ij^i). 


Consequently the equation in the x plane will be of the form (3 a). 

We shall prove now that the function Qfj(x) is proper in a region By,.. 
between a curve Fr tj , given by (3 b), and the positive axis of imaginaries (as 
well as further to the right until a line, in the first quadrant, is reached making 
an arbitrarily small angle with the positive axis of reals). In other words, we 
wish to show that along any particular curve, lying in Rf F( . and having a 

limiting direction at infinity, a definite finite order k 0 (depending on the curve) 
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for e Q U {x) as compared with ,x exists in the sense that, as |z| becomes in¬ 

finite along this curve, we have 


( 4 ) 





(*>*,) 

(*<*.)■ 


(If we work below the real axis, is replaced by It can 

be easily verified that, since in Qij{x) the constant fi,j is zero, Q,j(x ) will be 
proper in the whole plane excepting possibly in two arbitrarily small sectors 
containing the positive and negative axes of reals, respectively. In other words, 
Qij(x) will certainly be proper in any region in which there can be no curves 
having limiting direction at infinity, coinciding with that of the real axis. For 
simplicity of demonstration the subscripts i,j in the following will be omitted. 
It is sufficient to consider curves Ft lying in Rp r and given by equations of 
the form 

(5) v = h(-u)’ (A>0; p> d^ r)\ 


here 6 is not necessarily an integer. 
For x on Ft we shall have 


n — a = tg -1 (~) “ ~ W 

<-«) ' 


so that, along Ft, 


(5 a) 

and 
(5 b) 


91 (^ - I (-x)*') = \x\* sin (n - a) = P p _ i + • 


r r 


91 ( — x) p —\x\ p cos ~[n—a) — \x\ p + •••. 


Using (5), (5 a), (5 b) and the fact that, along Ft, \x\ ■» — n + ••• we have 
for x on F4 


(5 c) 


i Tehfl 1 

91 (^(z) -2;rA) / -iz] = t2A.'r-o)A(-M)^+ (-u) " + J 

+ [ 6 (_ l 4 )e+ ...]; 


572 



14 


George D. Birkhoff and W. J. Trjitzinsky. 


here the terms displayed in the parentheses are correspondingly the leading 
ones. When 6 = 1 ' the leading term in the second member of (5 c) will be 

r 

((2 kn — a)h + &](— u)*\ on the other hand, when 6>T the leading term will be 
s 

(2 kn- o)h(— u) p . In the first case the order k 0 will satisfy the equation 

(2 k 0 /t — o)h + b = o 
and in the second case it will satisfy the equation 

2 k 0 n — a = O. 

This completes the proof of the lemma. 1 
The following lemma will be proved. 

Lemma 2 . If in the region (1) + ••• + (m) the set 

©«(*).-•• Qr[x) 

has no point of division then this set has a proper curve F in (m) or further to 
the left (Cf Def 7; § i). 

Proof. This is obviously true, in any case when the limiting direction of 
the upper boundary of (1) + ••• + (m) is not that of the negative axis of reals. 
Accordingly we assume that the upper boundaries of the regions (1), ... (m), 

B\ B\... B m , 

each have limiting directions coinciding with that of the negative axis of reals. 
By hypothesis, if the set of functions <?(x) is separated into two groups 

( 6 ) &(*).-•• <M*h 

there exists at. least one member of the first group and at least one member 

of the second group which interchange order in (1)+- \-(m). This statement 

will hold true for s= 1,2, ... T— 1. Consider (6) with 5=1. Let Q k \ (.r) be 
the member with the least subscript which interchanges order with @,(a;). This 
would necessitate existence, in(i)+ - + (»n), of a curve. Obviously there 


It is sufficiently obvious that existence of sd order k e along F r insures the desired result. 
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If, 

could be no Bijtf = 2 ,... — 1) curves in (1)+ ••• + (w), but necessarily curves 

Bt. *!. Bs.kl, • • • *! 

exist in (1) + — + (m). On applying Lemma (1) it follows then that the func¬ 
tions Qi.k\{x) t are proper on and above a curve F in (m). 

Noting that 

0uW-0i«W+0uW UZUZK) 

we conclude that the functions Q^(x)(i £t, j£k\) are all proper on and above 
F. If there are any other ^(x)’s changing order with Q%{x) let 

«WK>« 

be the member, lying nearest to &l(x), which has this property. It follows 
that there is a Fi.iJ curve in (i)+ ••• + (»»), and that, in (l)+ ••• + (w), there 
can be no curves 

(#“ *. «-*:-«). 

Hence necessarily the following curves would exist in (1)+ ••■ + (m) 

£*l+i. and 

By Lemma (1) the functions 

0*S0*and 0i, t \(*) 

are proper on and above a curve F in (m) (F will now stand for the upper one 
of the two F-curves so far considered; we proceed in this fashion in each of 
the consecutive steps). Now 

Qtj (*)— 0#.»i(*)+0ftl.iW 

w+i 

so that the + * £ •*, j £ *i) are proper on and above F. We note, 

further, that 

Gfi(*)-flu!+.(*)-fluito + QtlM + 

Qi.*j(*)+ «*}.*!♦.(*) (iStSt:; ij + is jSk\-. 
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Here the terms of the last member are proper on and above F so that the 
above Qij{x) have the same property. In conjunction with the preceding it 
follows that the 

Q tj {x) (i*i,j£K) 

are proper above F. 

Suppose that <?*;(x), ... <?*;,(*)(;,£ 0 

( 2 -*:<*;•»<«) 

are all the Q(x)'s which change order with 0 ,(x) in (i)+ * * • + (m). By the 
reasoning of the kind just employed we can demonstrate that the Qij(x)(i £», 
j$k]) are proper above a curve F lying in (m). 

Consider (6) with s = k],. Let ^,(x)(d t > i) be the Q(x) of the first group 
which has the least subscript and changes order, in (i)+- - + (m), with at least 
one of the set of 4 >(x)‘s of the second group. Such a subscript 6 t (I < 6 t ^ A>.) 
certainly exists. Let all the ^(x)’s of the second group, having this pro¬ 
perty, be 

Qt](x), 4M(x),... l) 

w.+is. *:<*:<-<«. 

By the reasoning already employed and using the proved fact that the 

Qijix) (i £ j & H) 

are proper above F, we conclude that the functions 

Qn(xi (i Si.jZW 

are proper above a curve F in (m). Unless JJ = J\ when the desired result is 
achieved, we consider (6) again, with * = *£ and continue the process as speci¬ 
fied above (we have <J,>d*>d,= i). The proof of the lemma can be completed 
by induction and is seen to be applicable also when the B' curves are not 'simple. 
Another lemma will be essential for the purposes at hand. 

Lemma 3 . If the set 

(7) <?,(*),... <?„(x) 

has no point of division in (l)+••• + (»» 4-1) but in (i)+ • + (»») there is a point 
of division , then this set has a proper curve F in (m) 
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Proof. Suppose that the point of division in (i) + ••• 4 - (m) is between the 
/’• th and /* + i-st members of the above set. Since the set (7) has no point of 

division in (1)-*--h (m + 1) it follows that B *, the right boundary of (m), is 

a Br, f curve with, say, I' and, necessarily, pc? r+ 1. In fact, if we had 
r, q £ r or r, q ^ r + I the set (7) would have a point of division (between the 
T-th and V + 1 -at members) in (1) + --- + (m + 1). Now the set 

(7 a) 

has no point of division in (1) + •• + (m), the same being true for the set 

(7 b) Q rtl [x), ... Q,(x). 

The truth of this statement follows from the fact that B m ia B r . f (r ^ /"; 
Q £ r + 1) so that a point of division, in (1) + • • + (m), of either one of the 
sets (7 a), (7 b) would imply a corresponding point of division for the set (7) in 
(1 ) + ••• + (w + 1); the latter situation, however, had been excluded by hypothesis. 
Hence, by Lemma 2, the sets (7 a) and (7 b) are each proper to the right of a 
curve F lying in (m). In other words, we have the functions 

(8) (I SUSJ) 

proper to the right of l' (in (m)), and the functions 

(8 a) Q ti (x) (r + 1 £ 1, j Z n) 

also proper to the right of this curve. 

Consider 

(8 b) 4 M*)= V*. #>,(*) 

(iStST; r+i£j£»i; i£<7£n-r). 

Any one of the functions (8 b) could be written in the form 

Q<j(x) = Qi. /•♦.(*) = Qi. r (x) + Q r . ,(x) + Q f . r+<(x) 

(«. rzr, Q,r + ozr+ i). 

Now (Ji, r {x) is a function of (8) and Q fi r+o[x) is a function of (8 a); these two 
functions are proper to the right of F, in (m). On the other hand Q,, 9 {x) cor¬ 
responds to B m , the right boundary of (m). Thus on applying, if necessary, 

3 — 32611. Ada malfitmalioi. 50. Im|>rim6 Ip I wptrinhn- 1032. 


\ 
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Lemma i we conclude that Q r .,(x) is proper to the right of a curve F lying m 
(m). Consequently the functions given by (8 b) have the same property. 

Consider 

( 9 ) Qij(x) (iZijZn). 

Any particular Qij(x) of the set (9) belongs to one of the sets (8), (8 a), (8 b)‘; 
hence the functions (9) are all proper to the right of a curve F, in (m). This 
completes the proof of the lemma. By a similar, though slightly more complicated 
reasoning, tee. show that the Lemma is true also when B' curves which are not 
'simple' are admitted. 

Proper curves will be seen to be important since, as will be shown later, 
along paths lying in proper regions bounded by such curves, certain summations 
are possible. Such curves (and regions) are also essential in demonstrating that 
the periodic functions connecting certain sets of solutions are proper (Cf. Def. 

Si § I)- 

§ 3. Lemmas on Iteration. 

Consider the quadrant I* and the consecutive regions 

(l). (2),... (m), ... 

which are separated by B 1 curves, B\ B % , ... B m , ... and constitute /' (see § 2). 
On the other hand, there is a quadrant Q of a similar kind lying below the 
negative axis of reals and having for its upper boundary, (h l ). a portion of a 
line v = c > o. The consecutive regions of Q separated by B' curves will be 
denoted by 

(l), (2), ... W, ...; 

here (l| will be the region having (A 1 ) for its upper boundary. In some of the 
following sections we propose to envisage a process of iteration from the left 
(or equally from the right of course). Such a process will be first applied to 
region of type (1) or of type (1) + [2). 

As seen from § 2 the upper boundary of (i) extends indefinitely upwards 
while the lower boundary of (1) will extend indefinitely downwards. The neg¬ 
ative axis of reals may be a B' curve or it may not. In the latter case a region 
of type (1) + [1] is suitable for iteration. Regions like (1) or (1) + (2) will be 

' and Q Jt {x) = — Q t j(x) are both considered as belonging to the same set. 
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said to be of type K. More generally, K will denote a region either having the 
negative axis of reals for a part of its boundary or containing this axis; such a 
region will be allowed to extend to the right bnt always excepting a neighbour¬ 
hood of the positive axis 1 bounded, say, by curves of the form v= + hu f (h, e>o). 
The following lemma will be proved. 

Lemma 4 . Suppose that in a region K (that is, a region of type K) ue have 

(1) U - *. 

the set Qj(x) (j = being associated with a difference system (i) or (i a); (§ 1). 

Let the matrix of formal solutions of this system be denoted by 

(ra) SM-(«*%(*)) 

(here the series s tJ (x) are sseiies (Def i ; § i)) and let T(x) be a matrix, 

(.b) rw-{«'*'%(*)). 

in which Uj(x)(i , j = I,... n) denotes 8ij(x) with the power series terminated after k 
terms (k being sufficiently great). Form the matrix 

( 2 ) Y'(x) - %(x)) - A[p - i)... A (x - r) T(x - r). 

The following holds true. For x in K the limits 

(2 a) lim #,(*) - y ix (x) (t = I u- • • "). 

exist, are independent of k, are analytic and are the elements of a solution of the 
system Y(x + i) = A(:r) Y(x). Moreover , in K, 

( 3 ) Vi 1 (*) - & w st 1 (*) *)•' 

Proof. The matrix Y r (x) can be expressed as follows 

( 4 ) Y'(x) = T(x) Y'(x) (Y'(x) = %(x)), 

Y'(x) = J] T ~'( x “ » + 0 Mx - 0 T(x - t). 

/■1 

1 That is, when working from the left. 

* Relations like (3) are to be construed as denoting asymptotic relationship with respect to 
the power series factors. 
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Now define a matrix b -V the re,ation 

B{x)=T(x + i)T-'(x). 


On the other hand, we have formally 

^W-(«vW)-«*+ OS-'W; 

here the a,j(x) are in K of the form specified in the beginning of § l. By a 
direct computation 1 we show that 

(5) A(x) - B(x) - ~ H[x) [H{x) = (M*)) 


where |M*)I ^ h > for x in K ' whiie *• “ \ “ W* * 0; *• “* x 
Now | i4(.r)| ?«oso that | fl(s)| s* o; thus, writing 


M 

we have 


,1 (r) =#(*)[/+*(x)| 

(/ ■* (d,j) — identity matrix), 


JV(a?) - ^ B~'{x)H{x) 


k —» co). 


The M-f'h if not bounded ( for |*| > j» >0), are infinite at x - » to finite order. 
Hence 


(5 b) 




i ^ - </ s ; f/j ^ o; —* ce as X - —* « j; here, for x in K, | C,j(x) | <, C. 

now the product 


Consider 


T~'(x + i)^(/) 2 '(x) = T-'(x)B-'(x)Mx)T(x) 

= r-'(x)(/ + ~ cw) r(x) - / + ^ r-'W cw m 

Writing 

7 ^‘(x) = 

r'HClilTWH^^rgW] 

1 Compare with the analogous procedure io til). 
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where 


The r,j(x), if not bounded, are infinite at x = oo (in K) to finite order. Con¬ 
sequently 

< 6 > r-(x + l )A(x)T(x) = 1 4 

(I M*)l S A in K\ 6 = ^ — d; dfio; b—< <*> as i-*oo). 

By (4) and (6) 

( 7 ) ?•(,) - ( 7 + fj-jn, - .„)... (, * .2 rf m, - . 1 ) 

= / + y H(x “ *l) + v *(* -h,)B(x-k.) 
f «*-*.)* „■*!»- Krix-ktf- + 


+ 2 


ff(z-*,)...g(s -t.) 


*!<•••«*. 




- + 


so that 


(7 a) *?.(*)-i,.+ 2*^^ + 

' | ■ 

i... 

- a„ + + 

I 


+ 2 c- 








2 A-- J -i U %a I (x - 4,)Ai,j,(x lt (* - *,) + 

*•. -V-r* 
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Take k sufHciently great so that b > 2; then the series (7 a) will converge 
uniformly to a function analytic in K (for »= provided that the func¬ 

tions (?*" ' J* (*) have a non-positive or limited positive real part. Now such 
a function is expressible as follows 

= &.(* - *,) + &*(* -« + •■• 

+ “ **-i) + Qi v_,(* - *•) 

«= Qu{x) + (<M*) “ + (felt* ~ *1) ~ - *1)) 

+ ••• +(&,_, i[x - *.-1) — i(x - A,)) 


* r~*i *-*«-! 

= Qu(x) + J <fn(x)dx + J &iW<*x + - + f 1 


(x)dx. 


If g Xl = 3 x, 9 ?x, < SRx, then with path of integration along the straight 
line goining x, and x we have 


t * 

9i J Q'(x)dx = j WQ'[x)dx 


So 


inasmuch as ffiQ'tx) = 9 i#,i(x) £ O. The latter inequality holds, in K, for 
s — 1.... n. Hence, in K, 


(7 b) 




Using the fact that | hij{x) | ^ h (in K ) we conclude that the limits 

Mm #fiW = $«(*) 

exist. In fact 
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Moreover, using formulas (12), (13) of (II; p. 248), we have, for x in K, 


(|m/(x)| ^ m; « £ a; a, any fixed number). 


#«M = •’/I + 


e^JH 


(| mi' (x) | ^ ; 3 x = r; u^a; | r | £ </) 

where c is a sufficiently great positive number. Using (4) we get, for x in K 


y,x{x) — lim y' x {x) = ' (lira »/J,(x)) 

1-1 

».<*>- + -'l” 1 -)' (« £ «>. 

i-l 

Thus we have for x in /T and for 1 — 


"*> , <lW -*«( (<lW+ »tt) 

(I *?»(*) I £ 17; »<£«). 

».(*)-«W-(«nW + ^) 

»<£4 

Noting that if the functions fy(x), 1;! (x) are not bounded they are infinite 
to finite order at infinity (in the two regions, respectively), we have the two 
formulas, valid in A', 

(*») *,(*)_ *<*w^ lW + ey) 

(|a/(x)|^o; uga), 

<» b ) ,¥.i(x) = t*" |f ( 1 (x) + (^'“-Jr) 

(|o/(x)| ^ o'; N ^ a). 
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In the above c=- —d\ <l l ^ 0 and c — » as » and /? is the maximum 

order to which the iy,'(x) may be infinite at infinity. Now, as specified 

previously, the region K is not allowed to contain a neighbourhood of the posi¬ 
tive axis of reals bounded by curves of the form 

v = + hu'(h, e > o). 


Consequently for x in K, in a right half plane, 


x 

v 






«i -f 


IT (I + 


Thus 


may become infinite along a path to infinity but not faster than \x\ l 


where i — e< I. It follows then that a relation of the form (8 a) will hold 
throughout K with the constant c possibly smaller near the right boundary of 
K. However, for the whole region K, the relation (8 a) will hold with c —» oc 
ns A — *ao. That is, by taking It sufficiently great c can be made arbitrarily 


great. 

The yt\(x) are the constituent elements of a solution of the system Y{x + i) “ 
« A is) r(r). Moreover, the limits are independent of le. This can be demonstrated 
by the reasoning of the kind employed for an analogous purpose in (II). This 
completes the proof of the lemma. 

The above lemma is concerning determinant limits of first order. We shall 
now consider determinant limits of higher order. Determinant limits of various 
orders have been previously used by Birkhoff in the paper (II). In this connec¬ 
tion the following facts, needed for the purposes at hand, will be stated. With 
the system (i a) (§ i) there is associated a difference system of order C'['(A — 

2, ... w), 

(9) n t*+ l)-.li(x)r*(*) 

where 


(9 »0 


A t (x) = (a, t ..)) 



I.• - • »; /,</,<••< /*; j x <h< • < jt). 


The multiple subscripts in the above and in what follows are to be construed 
in the sense made apparent by the relationship 


583 



Analytic Theory of Singular Difference Equations. 


25 


foj, hjt • • • Kit 

Ki, hy, ... hu k 

(10) K ‘t.J* h~ 

K k j, K k h • • - Kh 

while in the matrix (of order C^) 

(10 a) (Kif'ii-h) 

the set of subscripts (»,... it) refers to a row and the set of subscripts (j,.. ,j k ) 
refers to a column of the matrix. The difference system (9) possesses a formal 
matrix solution 

(11) = *...*(*)). 

The formal series *...*;*...*(*) are s series (Def. 1; § 1) and linearly independent. 
If (flffjfc)) a matrix solution of Y(x + l)= A(x)Y(x) then 

will constitute a matrix solution of (9). 

With the above in view we shall state the following lemma. 

Lemma 5 . Suppose that in a region K the coefficients of a system (1 a) (§ 1) 
are known. Assume, moreover, that in K, for all j t , .. .jt ^ n with j, < ••• < jt , 

(12) Hltf (x) + - 4 (ft(x)\ £ *(<&(*) + - + <SM 

The functions <?,(x), are to be considered as associated with the first, 

second, ... and k th columns of S(x), respectively. Form the determinants 

by means of the elements y^(x) of /he matrix ) r (x), defined by (2). 

The following is true. For x in K the determinant limits 

(12 a) Hmy;,.. ,*;i . .*W 

(/,,... it = I,... «) 

4-32M1. Ann mnlbrmalim. 60. Imprint \r 1 •rj.l.-mhrr 1932. 
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exist, are independent of the number of terms retained in the pcncer senes fact* 
of the t tJ {x) and they are the constituent elements of a solution of the system ) + I 

(fe). ( 9 ^)). Moreover, in K, 


(12 b) 




t [x)-e^*-^ lr \ 

I..--")- 


i k ; 1 



Proof. By Lemma 4 this lemma is true for k= !. When k - 2, as can 
be seen from (II; PP . 253-254). there is the following situation. With reference 
to the system (9), formed for k = 2, consider the product 

( . 3) Y;(x) = (£ „ (x)) =A t (x- 1 )... A t (x - r) T t (x - r) 

= <«*«<*» -1.... cj;«.. . i. **.•••») 


where 7 t (x) is 5,(x) with the power series factors terminated after a sufficiently 
great number of terms. Let the columns be so ordered that the function Q{x) 
of the first column of S t (x) is <?,(x) + &(*)• With (12) assumed in K for k - 2. 
by Lemma 4 it would follow that the limits of the elements in the first column 
of (13) exists in K. Moreover, these limits will satisfy all other properties spe- 
cified, in the lemma, for the determinant limits (of order two). Now by the 
reasoning precisely of the kind employed in (II; p. 254) it follows that the 
elements in the first column' of Y\[x)(r = 1 , 2 . ...) are correspondingly identical 
with the determinants 



H.M- *.,(*) 


The cases Is ■= 3. ... n can be treated in a similar way. 

We shall consider now a region R bounded on the left by a curve with a 
limiting direction at infinity, and extending indefinitely upwards (or downwards) 
while to the right such a region will be allowed to extend at most up to a 
curve of the form 

v = ±hu' ( h,e>0 ) 

(In general both boundaries of R will be B’ curves). Iterations for regions of 
this type will be specified by the following lemma. 

1 The column specified by the pair of subscripts UJn )" V 1 2 ~ 
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Lemma 6. Let V denote a strip of unit width situated immediately to the 
right of the left boundary of R (a region specified above). Suppose that we know 
in V a set of i) solutions of the system (l a) (§ l), 

yij(x) (*'— j-= I,...*) 

which satisfy in V the asymptotic relations 

yiM ~ e QjlI) Sij{x) 

and are analytic in V. Moreover, assume that in R 

(14) (j=2,...n) 

and that in the first column of S(x) (and of T(x )) ice have Q(x) = Q t (x). 

Letting T l (a:) denote r(x), with the first s columns correspondingly replaced by 
the ytj(x) (1 — 1,... »; j ■■ I,...i), and defining the matrix Y r, (x) l/y the product 

(< 5) l"'(x) “ (»;;(*)) -A[xA(x - r,)T'[x - r.) 

(x — r, in V; A (x) known in R) 
the following can be asserted. 

The y''(x) (*' — 1,... n; j — 1,.. .s) are s solutions in R, and constitute ana¬ 
lytic extensions to R of the s solutions originally assumed as known in V. The 
asymptotic form of the elements of the first one of these solutions will be 

(15a) /'(*) « yn(x) - e Q,,,) #/i(x) 

(1— I.... si; x in R). 

Proof. It is observed that 

>''Mx)=r(x)y''(x) 

where 

Y r '(x) = [ j] T-'(x - « + i)A(x - i) T(x - t) • 

• (r~'(a: - r x + 1 )A(x - r x )P(x - r,)). 

Here the expression for Y''(x) differs from that given before for Y (x) in the 
lust factor, and that just in the power series factors, by arbitrarily great powers of 
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x ~r. This is due to the fact that, as a consequence of the conditions of the 
lemma, 

T'(x) ~ S(x). 

Moreover, for every 6 xed x we have r* finite, so that there is no necessity for 
passing to the limit. Making direct use of the matrix equation (i a) (§ i) it is 
immediately obvious that the first s columns of Y r ‘(x) are analytic extensions 
of the solutions whose asymptotic forms have been assumed in V. Using (14) 
and applying the reasoning of the kind employed in proving Lemma 4 we derive 
the relations (15 a) thereby establishing the lemma. 

For determinant limits we have, by application of Lemma 6 , the following 

result. 

Lemma 7 . Let 11 and V have the meaning indicated in Lemma 6. Suppose 
that we know in V a set of s ((% Z s £ 1) solutions of the system (9), 

(16) 

(i„... <* - I,... »; » sets l/i • • >*)) 
which satisfy in V the asymptotic relations 

.„(*)' .*«♦-♦*«*•* „(X) 

and are analytic in V. Assume that in B 

(■6a) SIGW+ •" + «(** * HKW*)+ • + «*(*)> 

(ll< ••<>*— !»..») 

and that in the first column of S k (x) (11) (and of T k (x)) we have <?(*) “ Q x [x) + 

• + &(*)• 

Let T k (x) denote T k (x), with the first s columns (corresponding to the s sets 
(j[ ...fi) in (16)) replaced by the elements 0/(16), respectively. Define the matrix 
Y r k ‘(x) by the product 

(' 7 ) If!*) = A k (x - 1)... A k (x - rj T k (x - r,) 

(x — r t in V; A(x) known in /?). 
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The functions («„...** = the * sets of (16)) 

will be constituent elements of s solutions , in R, of the system (9) and will represent 
analytic extensions to R of solutions (16) originally assumed as known in V. The 
element ? of the first one of these solutions will have in R the asymptotic form 


17a) 


(i,< •••<«»= I,... n). 


Theorems entirely analogous to those of this section may be formulated 
when we work from the right instead of from the left. In this case a cut is 
made along the negative axis of reals to 6x the determination of S(x) and we 
consider the symbolic product 

Y[(x) = A-'(x)A-'(x + 1)... A~'(x + r - l)T(x + r) 

instead of l"(x). In this case we exclude the neighborhood of the negative 
axis of reals bounded by curves of the form 

v= t h(-uY ( h , e > o). 


§ 4. A Lemma on Summation. 

We shall now establish a modification of the method of contour summation 
used in (II). 

Let R denote a region the left boundary of which is either h (the lower 
boundary of T (§ 1)) or a curve, extending indefinitely upwards, with a limiting 
direction at infinity. Let the right boundary of R be a curve, extending inde¬ 
finitely upwards, with a limiting direction at infinity. This latter boundary, if 
with the limiting direction of the axis of reals will be assumed to be a curve 
of the form v = hu e [h,e> o). The left boundary of R, if extending upwards 
and with its limiting direction coincident with that of the negative axis, will 
be of the form v = h(— uY + •• (h > 0; 1 > e > o). 

The following lemma will be proved. 
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Lemma 8. Assume that the function 


(«> 


H(x) = e**h(z) 


is analytic in R while Q[x) = px log x + yx+ -+rx^ » proper on and in the 
neighborhood of the right boundary of R and 

(I a) Kx) ~ H(x) K) 

where H(x) is a formal sseries (Def I; § l). Furthermore, suppose that 

iRCWSo {in IQ- 


3 W 


(*> 


The equation 


y{x + i) — y(x) = e^h(x) 


possesses a solution y(x), analytic in a region R' interior to R by a distance e{> o), 
for which an asymptotic relation, 

( 2 a) y(*) ~ « QW, ( a? )- 

where s(x) is a formal s series, is maintained in the above region. 

Proof. The formal equation 

y(x + i)-y(x)-e««lf(x) 

is formally satisfied by y(x) = «« w #(x) where s(x) is an s series. This follows 
from a Lemma proved by BirkhofF in (I; p. 218). Let f(x) denote s{x) with the 
power series factors terminated after m terms (with m sufficiently great). Sub¬ 
stitute in (2) 

(*-?) 


(3) 


,(*) = ««“’(<(*) + $!) 


The new variable z(x) will satisfy the equation 

(3 a) q(x + l)*(x + l) — q(x)e(x) = —J-—- 
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Here 

= «*(•)»(*) - j e vut(x) 

and /?(x) is analytic and bounded 1 (| 0 (x)|^? 0 ) in R. To demonstrate the truth 
of the lemma we need first to show that (3 a) has a solution z{x) analytic in 
R' and, if not bounded in R\ infinite ati=» to a finite order ~k which is 
9 uch that k — k approaches infinity with k. 

The equation (3 a) with the second member replaced by zero is satisfied by 


Hence 


*«(x) = 


r(x) = 2*e-*’' 


- ^*(0 -?(*). 


1-1*1 1- 


will be a solution of (3 a) provided the operation ^ is suitably specified. 

1-* 

With x in R' let L denote a contour lying interior to R and defined as 
follows. When along the negative axis of reals while the lower 

boundary of R is h (the lower boundary of T} then L is to consist of h and of 
a path L* near the right boundary of R. In all other cases L is to consist of 
a path near the right boundary of R. With x not nearly congruent to L (that 
is, if x represents the point on L for which 3 *' — 3 * we have SR(x-x') not 
an integer) let x + k t (*, £ o) be the last one of the sequence of points x, x + », 
lying to the left of L. Let /, denote a loop which contains the points x, 

x + ,. x + h and passes between x — 1 and x and between x + k x and L. 

Now, by hypothesis, £(x) is proper along L (and also at least within a limited 
distance from L). Hence a least integer l can be found so that 


?i(x) = 2 nlv + MQ(x) -* + oc 




as | x | —* 00 upwards from the axis of reals along L. If n o, L is near the 

• pw could be considered to be a function asymptotic in fi to a formal sseriea whose 

_l 

power series factors begin with low powers of x v. 
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im aginar, axis of reals.• This integer l will be unchanged if L i 
finite distance in the direction of the axis of reals. 

Write, for x in R\ 


is shifted a 


( 5 ) 


q e Q(0 0(t) _ T 

J (i - e“' v -" 








Here 1 wiU be supposed to have the value specified above. The path L, is to 
consist of L, described upwards (if L = h + V. then k is described ^om -nfinity 
to the neighborhood of the origin and L‘ is descnbed upwards), and off,, 
scribed in the clockwise direction. When * approaches a position of congruency 
L we shift L through a suitable distance. The integral (5) will converge 
since *-.(*) remains bounded .long L; moreover, it will represent a sum formula 
in the sense of (3 c), and the function of > given by the second member of (5) 
will be analytic in If. It remains to show that this function is such that *(*). 
as defined by (3 b), has the desired properties. 

Let * be the point on L for which /,■ - I.. Denote the portion of L up 
to that point by L, and from that point up by L,. If L = h + L\ let L, 
denote the part of f up to x' and let V, denote the part of f up from x 
For t on L and for x in R' we have 


M 

The inequality 
( 6 ) 


|, > d > o. 

9Wx)*tt«(x,) 

( 3 x- 3 *r, SRxSfflx,; x.x, in R) 


will be also needed. It is seen to hold, in virtue of (l b), since we have 

9 l<2=- j 3 \</(x)dx. 


With these preliminaries in view consider the integral along l r , 

• This is due to the fact that along such a path and, in general, along a path extending 
upwards with its limiting direction coinciding with that of the positive axis of imaginancs we 
have R(t log x) behaving as a constant multiple of |x| (i. e. v\ 
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, 4 C e?U( 3 (x) e««' +, »/ 9 (x+i) e 9 ^fi{x + k t ) 

[7] J - (x+if lx-*-*,)*' 


In virtue of (6), for x in R\ 

(7a) |/| £, * ,w ^ + ir^ + " + F^) 

Now, substituting z = — x + I in an inequality of Birkhoff 1 , 


we find that 


2i pr+Tp < 2 1 x — i f* -1 

i m • 

so that, if R extends to the right of the imaginary axis, 

,7b) f^p + + i^TfaF- < 

(JRx = u > o; A, independent of 1 c). 

Let x(|x| > q > o; o) and x + ** be above curves 


(9ix>i) 


(8) v=h t (-uY' 

(8 a) v=h l u^ 

Substituting z=— x+l in the inequality (13) of (II). 


(/»,><>; !>c,>o), 
(^>0; 1 >e t >o). 




it is found that 
(8 b) 


ixp + "■ + ix+*,r"t'-» 


< -pri (w^o; 3 x=t;). 


1 See formula 12 in (II). 

f» —32511. AtUi moUtfmnlira. SO. Imprim* k 2 *eplembre 1932. 
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Let «(«<o) be the value of « for the point on the curve (8) whose ordinate 
is %X=V. Then, from (8 b), it follows that 

k\ ^ KM . 


Now for a suitable I, independent of x (u £ o), 

!«|2/|x|. 


whenever x is above the curves (8), (8 a). Whence 

. t ' - '»■(*'> 

|ip + W W'"-" 

(m^o; x above (8)). 


( 9 ) 


Thus the inequality 



K' 


WTitf < |x|*. 




holds, whenever x(|x|>p>o) lies above curves of form (8), (8 a). Here *, can 
be made arbitrarily great by taking k‘ sufficiently great and h‘ is independent 
of x. Hence for x in k\ above curves of type (8), (8 a), wc have 


(10) 




Whenever R contains the negative axis of reals the inequality (10) will continue 
to hold in the whole region R' provided that along the negative axis of reals, 
for | a: | sufficiently great, 91 Q{x) *o. In fact, from (7) it follows that 


(10 a) 


/I* •’“ 4 V 


eSiou* u-<M) 

Ix+Tf 


+ •" 


JtKKx+V-QWH 

+ (x+Wr 


on the other hand, we have 


(10 b) 


9 l*» 

= 9 lx< 9 ix; x, x in R) 
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diminishing very rapidly as x — x increases. So it is clear that the sum of 
terms in (ioa), involving factors of the form (lob), is negligible to the extent 
that (io) would hold throughout R'. 

The only case when the inequality (10) is not asserted is when R contains 
the negative axis of reals, while x is in R' below a curve of the form (8) and 
e* Q( ' > remains bounded as x moves to the left along a line parallel to the nega 
tive axis. In the sequel it will be seen that it is precisely in this case that it 
is not necessary to consider the integral 



(In particular, see Case II below). 

There are two cases to be considered. 


Case I. Along the negative axis of reals, fur \x \ sufficiently great, 9 i Q{x)y* o. 

In this case along any line in R, parallel to the axis of reals, 9 l 4 >(x) in¬ 
creases not slower than a positive fractional power of |x|, as x moves along 
such a line to the left. This is an immediate consequence of the nature of the 
function ^(x) and of the inequality (i b) (which insures (6)). 

Now, taking into consideration (5 a) and the integrand in (5), 



By (4) the integrand in the above increases exponentially along L, and attains 
its maximum at x, the upper end point of L,.. The integral will be of the order 
of magnitude of the value of this integrand at x. Thus 


(*, = k' — rf,; d s ^ o; k s -* » as k' -* » ; = %x'). 


In the case at hand, the function 

,Hir— 

approaches zero very rapidly as x moves to the left from x along a line through 
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parallel to the axle of reals. When * remains in a sufficiently close neigh 
borbood of X and does not part from x too rapidly as e(- 3x - 3*) mcieas , 
the fact can be used that 9 !(«M-«(*)) S o (9(x § Sxl; we shall then have 
„(*•) either bounded or infinite at infinity to an order t, such that ^ 00 

„ More precisely, this will be the case for x in any region bounded 

on the left by a curve of the form 

v = h(—uY {h,e> o) 

where < can be taken arbitrarily small. If there is occasion to consider a region 
below such a curve, for * in such a region (with e sufficiently small) we shall 
have approaching zero exponentially (i.e., as c - ' 1 ' 1 ' (r, y > °)) as 

| r |_, oo along any path to infinity in that region. It is clear then that 

l/KiS" 

• (* 4 -* « as lc' — «; x in R'). 

The integral along L, will be written in the form 

IJrlJ (r^V-.u-o-,)<*■' | 

11 L* 


For x in 11' 




As / moves along L, from x' upwards we have pi-i(f) bounded. Therefore the maxi¬ 
mum of the integrand, last written, occurs at x'. The reasoning of the type used 
in deriving (i l) will show that 

...» I/I<13? 


(k s -» °o as x in R') 


so that 


3' c* QM 

< 1 if 


[k" — °c as k' — »; x ii) R'). 
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Case II. Along the negative axis of reals SQ(i)=o. 

If the left boundary of R is not h this boundary will be of the form 

v== h{-uY + — (M> o). 

An inequality like (12) will continue to hold in R'. This can be shown by the 
reasoning used to derive (12). If the lower boundary of R is h the contour L 
will consist of 

L = h + L* = fc + L\ + L\. 


The contour L Zt in (5), will then be deformed into a loop, described in the 
counter clockwise direction and extending to infinity, containing the points x — 1, 
a: —2, ... and not containing the points x, x + I, — The formula (5) will 
yield the following 


:> 3 ) 


«•«-»*(*-1), ^(i- 2 ), 

(x-if + [x-if 


inasmuch as convergence may be asserted. 
Now 


(13 a) iRG(x)«ji|x| log \x\ cos a + (/«-/'v) 

+ l*?l|xpcos +^«) +••• 

(y-/ + V~i y"\ rjp>s.Z 1). 
Necessarily /< = y' = o and, whenever a coefficient 17 in Q(x) is not zero, 


3 *>) 


cos 




* 

Hence Q (x) = V~— 1 /' x + rj x* + • • * [p > s 1) while 

# 

(13 c) WQ(x) = -y'v± Mlxfsin j-(rr-a) + ••• 

(/> > * ^ 1); 
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hefe we may have |,|-o. Thia relation is derived by noting that, in virtue 

of (13 b )» 


(-, + f 0 ) = ±.in?(n-o). 


If SR Q[x)=-y"» ‘ten 

(ljd) «[«(*-«-CWI-o (*=', 2- 1 

If in (13 c) M^o, we define a curve F„ in I\ by a D equation 


„ = *(—«)"(*> o; 


For x in R below Fu we have 


. v _ v 

*-«-* — = — u + ' 


sin -(nr 

P 




ho that 


(14 a) 


s. a ^ hs[—u) u 

,in p («-o) 


Thus, below Fu, 


_ o) s + .-lal** + 




since |x| — — u + ••• 

Similarly, if x is in R below Fu, 

(14 c) 11?|I* - *| p sin | (nr - a,) £ + ... 

{x — i = \x — i= 1.2....). 

Noting that 

S % :'<?(x-«)- (M*)) — [± M |x “ *T* sin ^(ir - a,) + • J 

— MM P sin ■(» — «) + •■•J. 

we have by (14 b) and (14 c) 
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(. 5) 91 (<?(* -!)- <?M! £ * W— + ■■ <« 

[q independent of x, i; x in I? below 2'//; » = 1,2,...). 

Hence, whether 9 i<?(x) = - y"n or |ij| in (13 c) is not zero, the inequalities 
(15) are seen to hold at least for x in R below F u . Thus, from (13) it follows 
that 


IS 

Iml 


(x in R\ below Fu). 


Further, by formula (12) of (II), 


Id * I ■* 

l-x 

(x in li\ below Fu). 


If li extends above F„ the expression (13) does not appear useful for pur¬ 
poses of demonstration, whenever x is above Fu- In this case we use the re¬ 


lation 



The first of the last three integrals satisfies inequality (10). As to the second 
one, we have the integrand (as displayed in the second member of (5)) bounded 
along h (while x has a fixed value in /{' (on or above /'*//)). We have 

* ) l*r 

L, 

Here pi(f) is bounded along h and increasing exponentially along the remaining 
part of L lt i.e., along L*. Hence 



..ft*™, ft* 


l*T I xf 

ft lx 






Us — ao, as k' -* *; %x = = r). 
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With * restricted as stated, we hare ||| behaving in the most unfavorable 
way when * is on Fu\ we have then 

#<a + + - -*(-«* + 

(16) |a;i| s v *(-“)" A 

Thus, for a: in If on and to the right of F„, 

ft e* 91,1 


< T*T- 


-* » aa Jc -*<*>)■ 


A similar inequality is obtained for J. valid in the same region. In proving 
this inequality we again make use of (16). Hence 


(' 7 ) 




» as — « ; x in R' on and above /•’//). 


But in virtue of (15 a) an inequality like (17) is seen to hold throughout If. 

This completes the examination of Case II. 

The result just mentioned, together with (to) and (12). enables us to assert 
that an inequality like (.7) holds, for * in If. in any case. It follows therefore 
that »(*), “ B' ven by (3 b), satisfies in If an inequality 






Now ic _ J. = and approaches infinity as * approaches infinity (see (3 a)). In 
(3) attach subscript to „(*),«*),*(*)■ It is clear then that (a) holds for 'J.(x) 
to m(£) termes (m(*)-*» as *—«)• 

It remains to show that (,.(*)-*(*))*- Ql, ’(= 9 .*(*)’. in " If h 

is part of L. y.,(x)«o; otherwise, |9«<(x)l 3 (in If). Application of (4) 

and (6) completes the proof. 


§ 5. Construction of Proper Solutions to the Right of a Proper Curve. 

The following theorem will be proved. 
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Theorem I. Assume that the coefficients of an equation L»(y) =o (2; § 1) 
are known (Cf § 1) in a subregion of F (§ 1), 

G *(1) + (2) + ••• + (wi) + + it]). 

I jet the corresponding functions Q(x) be 

(1) 

Suppose that /•', a proper curve (Def 9; § 1) for the set (1), is the left boundary 
of (m) (2 ^ m ^ ij) or lies to the left of it. Assume that in a strip V, of unit width 
and with its left boundary coincident frith the left boundary of (»»), there exists a 
proper fundamental set of solutions (Def 4; § 1) satisfying the equation L„(y) = o. 
It will necessarily follow that L n (y) is completely proper (Def 6; § l) in (wi)+ +(q). 

If F, a proper curve for the set (1), exists in the region (1) then the above 
assumption concerning existence of solutions in V may be omitted and it will ne¬ 
cessarily follow that L n iy) is completely proper in (wi) + ••• + (q) (m = 1). 

Proof. As stated previously the regions (1), (2),... (rj) are separated by If 
curves, 

(2) B\ B\ ... B'-'. 

In any region (s) of this set of regions the Qj (x) (j = I,... n) maintain a certain 
ordering. We shall write 

(3) «W * » >Qt W fc "■ fc » .Q* W- (x in M). 

In connection with this ordering the subscript s will be attached, from the left, 
to some other symbols; thus, * 5 (x) will denote the formal matrix of a difference 
system corresponding to L n (y) = o, with «Q(x) entering in the j-th column. 
The set MM*), • • • .&(*)! is merely a permutation of the set (< 2 ,(x), 

&(*).••• Q* Wi¬ 
lt is sufficient to prove the theorem for the system Y(x + 1) = Z)(x) y(x), 
related to the given equation (2; § 1) and given by (6; § 1). This follows from 
the relationship (6 a; § 1) between solutions of the system and the single equa¬ 
tion. It is clear that the d,j(x) are known and of the same character as the 
U}(x) (; = 1,... w), the coefficients of L„(y). The process of construction of solu- 

6 — 32611. Acta mathrmatica. 60. Imprim6 l« 2 wptnnbr* 1932. 
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tions, about to be given, is of course equally applicable to any system Y(x + il- 
X(x) !'(*) (' a; § ')- 

By iteration (3) we construct, in (m), determinant limits of orders 1,2. ... " 
corresponding to the <*Q[x)s 

„?,(*), -Qi (4 + -<?.(*)'.&(*) + + 

respectively When m = 1 this process will be carried on by -iteration from the 
infinite left- (Lemmas 4 and 51 § 3 ) When » > , the process will be earned 
on by iteration from the strip V, specified in the theorem. In the latter case 
use will be made of the existence of solutions in V, as stated in the theorem; 
in this connection Lemmas 6 and 7 (§ 3) to be used. Generally speaking, 
application of Lemmas 4. 5 . 6 and 7 is possible in virtue of the inequalities (3) 
being valid in (m) (for « = «)• 

In agreement with the notation of § 3 let “>ese determinant l.mits be 
denoted, for k I, ... n, by 

( 4) 1 ..*(*) (f,< • <«*;«.. •••**- .*»)• 

These functions are analytic in (m) and satisfy the asymptotic relations 

(», < • < fV, 1*- 1 . ... m; * in (»|)). 

For k — 1 the functions (4) are elements of a solution of the system Y(x + 0 = 

D(x) r(x); write 


(4 b) 




... 


This solution is proper in (in). Assume that, for k— 1 £ 1, there exist / 1 

solutions, 

( 4 c ) mZij(x) (1— l- m; j= 1,.. • k— 1). 

which are analytic in (m), satisfy the relations 

(4 d) mZi t 1 ...»(*) “ ^ 

(i, <•<»#= I _ »; 9 — l. • • • k — 1; X in (mi)) 

and are such that, in (m) 
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(4 e ) ^{x) ~ e-^ trl -*W (»= i,.. • n;; = i,... k- i). 

Existence of a matrix of solutions, proper in (m), will be demonstrated by 
induction if we show tb^t there exists a solution mZtk[x) (t= I,... n) analytic 
in (m) and such that (4 d), (4 e) will hold for s = k,j = k. Analogous to a similar 
construction, in II, such a solution can be found in terras of certain determinant 
limits and the solutions (4 c). For this purpose use will be made of the following 
formulas, found in (II). (The notation used in this paper is different from that 

of (II)). 

We have 

(5) mhk[x) = 

V m - ^(fl ■»*»- .(<). 

15 a » - } *J W “ ^*-11 (<j (f + !)’ 


J 1-* 


Vkj(Q, 


(5 b) 


rntt l(< + 1 ) • • mtlj-l (<+ 0. + *)••• 

•*11 (f + *)••• 




(5c) ^(0=2 + - 

»-» l L *.-> 


mVl . #|_i ;!...»(<)- 


Moreover, for x in (m), 


M) 

while 
(5 e) 


. 

^.(x+i-i). 




-*„(*). -*«(*). •••■*«*(x) 

•‘n (x+i-l). 
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Tbe symbol ^ stands for summation and is to be suitably determined. It is 

seen that W ... -*"W arc known and a " a ' ytiC 

' Using (5 b) and the known asymptotic forms (4 e) the asymptotic form of 
>>e seen to be 


( 6 ) 


,mj. (*) - e - 9 ' 1 ' 1 *■'• ** — <*> 

Lui t _i (a:), an s series; x in (m)). 


On the other band, making use of (5 d) and taking account of the way several 
formal series with logarithms in the »■ Series factors arc related (See (I); in 
particular, ( 6 ") on p. 213). we conclude that 


(6 a) 


m ff.» ( x ) -e-"-' 1 * * -* w (a) 


(* in (mi)) 


where m y l,) (x) is an .^series without logarithms. 1 The series (*)(*«■ i» w ) 
cannot be identically zero since the formal series are assumed to be linearly 
independent. Consequently, by (5 a), (6) and (6 a). 


(6 b) 


«K»j(x)~#^ W «*iM 
O' — i,... k- 1; x in (m)). 

Here tho series m i tj (x) are all s series; moreover, by ((3); * “ >»), 

o U-1; * SnM). 

It is easily seen that Lemma 8 (§ 4) is applicable for evaluation of any of the 
expressions 


(6 c) 


§ 


r*.w 


0 ■“. . - 1) 


occurring in (5). In that lemma we only need to take /* = (>»), Q(x) — nQi-jU)' 
h(x)«* mVkj[x). Thus, by the methods of § 4 *e evaluate (6 c) as a function 
analytic in a region (m)\ slightly interior to (m), and satisfying in (mi)' an asymp¬ 
totic relation 


1 We can show this by a reasoning, applied to the second member of (5 d\ similar to that 
in (I; p. 215 ). 
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(6 d) 

i*/ 

where m u t ,(x) is an 5-series. Since the boundaries of (in) can be translated, it 
may be considered that (6d) holds in (m). Substituting (4 e) and (4 d) in (5) it 
is seen that «?it(x) is analytic in (m) and 

(7) n*\ k (x) - e" ,g * ( ' 1 mOx I (x) (x in (m)), 
where n 0ift(x) is a formal 5 -series. 

The remaining elements «x*t(x) (1 — 2,... n) of the solution may be de¬ 
termined as follows. We have 

* 

m*i k (x + q) = 2 (l ' 1,(x + q — l)^2, i,(x + q - 2) ... I n (x) * (x) 

V " 1 

(9 *=.. 

By the reasoning of (II; p. 259) these equations have a non-vanishing determinant 
so that 

(8) -A*lx) = d 4 ,(x)»*i*(x+ 1) + + d<,,(x)«fu(x+w) 

(»- 

Here the «b>(.r) (/, j ■■ 1. ... 11) are known in (in) and are of the same nature as 
the do (x). Thus, by (7) and (8), the elements »2a(x) (1 — !,... u) are analytic 
in (in) and satisfy, in (in), the asymptotic relations 

(9) (x) ~ (T Qt[I) m a,t (x) (i — 1,... n) 

where the series ^o,»(.r) are 5-series. Necessarily the relations (4 d) will be satis¬ 
fied for 5= 1, ...k. The function »*u(x) is such that (5 e) holds; thus, using 
tha asymptotic relationships (5 d), (4 c) and (9), we conclude that the m fl/t(x) 
(/ — 1, ... 11) in (9) can be replaced by the .v <t (x) (1 = 1,.,. 11), respectively. Thus 

a solution „f, t (x) (/ = 1-n), possessing all the desired properties, has been 

constructed. This proves existence of a matrix solution proper in (in). The above 
indicates also the actual process of construction in any given case which satisfies 
ih.* specified hypotheses. It is essential to note that in applying Lemma 8 we 
have, according to the hypotheses of the Theorem and as required by the Lemma, 
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,he function <?(*) = -< 3 * jM P™P« r al ° n S the P ortion of the P a,h of inte g ration 
r the right boundary of (m). If in the various summations involved in (5) 

additive periodic functions are admitted the *th solution, •*»(*) (•« L • • • *)• 
!vill be modified by addition of linear expressions (with periodic coefficients) in 
the elements of the preceding k - 1 solutions. Unless stated otherwise such 
periodic functions will not be introduced, the summations in (5) being specified 
by § 4 The n solutions constituting the n columns of the matrix 

(l0 ) mZ(x) = UftjM) (*.>=«,••• ”) 

may be spoken of as 'associated with determinant limits’. 

The regions (1), (2), ... (1?) may be considered as having strips Vjj+i (between 
(j) and 0 + 1); j = I, ... 17 — 1), of unit width, in common. In the case at hand, 
there exists a proper matrix solution in V m . *+1 (if m + l ^ 17). By the process 
indicated for the construction of m Z(x) we now obtain a matrix solution, 

( |0a ) m+lZ[x) = (m + l*lj[x)), 

proper in (m + 1), the constituent solutions (columns) being associated with the 
determinant limits, known in terms of the m*ij[x) in (wi +1). By a finite 
number of steps proper matrix solutions, T Z (x) — {r*tj (x)) (r — m, ... w), are 
constructed in (m), (m + l), ... (17); these solutions will be associated with deter¬ 
minant limits. 

It remains to demonstrate that the periodic functions connecting these 
solutions arc proper (Def. 5 ; § ')• Let Z»(*)-(*?,(*)) denote ,Z(x) with the 
columns so rearranged that 

(,,) U in (r); rS 4 

Write 

(I I a) Z'(x) = Z' +, (*)P , r+l W. P r f+, W = (fj ,+, W). 

Let = 3 x, x—./> = integer and restrict r, to lie in the strip V r ,,+ > (when 
We have then for the matrix P rr+, (x) of periodic functions the 

relation 

(, , b) P'-'+»(*) - P r r+, (Xr) = Z'"-'(Xr)Z'(Xr). 

For x(=x,) in V r . r+1 the following asymptotic relation will hold in virtue of In) 
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P r ’ r * l (x) ~ S-'W S(z) = (eV'‘ do) 

m=n 


47 


In other words, for 3 * ^ f > o. 


( 12 a) 



(/,j=l, ...»); 


in the above k can be made arbitrarily great and the |6i/(x,)|, for a fixed k, 
are bounded. The strips V rrtl (r=»n,... »;-i) extend indefinitely upwards (i.e., 
when |*| approaches infinity in K,.y + ,. r= 3 x-* + »). These strips are to the 
right of a proper curve F and they are in a proper region Hr (Def. 9; § 1). 
The term proper’ refers, in this connection, to the set 


4M*).... 4>„(x). 


By (11b) the p'j r * l [x) are analytic for r £ g > o. By (12 a) and in virtue of 
the fact that the Qi)[x f ) are proper in F r .r*i (Def. 3; § 1) it follows that 


(« 3 > 



(«,i-“-17-1; H'/+ x , an integer); 


here the power series converge within a sufficiently small circle with z =0 for 
center and, unless p r l) , * l (x) ■* O, it may be supposed that p'j'£'*0. Now 
|*| =. |*j*i ^T/| — e -4 "*; thus, it is clear that 


13 *} 


, —; n'.f+l 

pb’+'w-pi::'''" 11 


(».i-i...• »*; r=m > • - - *7—>1 


in every region of the kind indicated in (Def. 5; § 1). Hence, in accordance 
with this definition, these periodic functions are proper. In view of the rela¬ 
tionship between solutions of the single equation L n (y)=o, and those of the 
system, L„[y) is seen to be completely proper in (m)+ • •+(»?). 
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§ 6. A Lemma on Factorization. 

The following lemma will be indispensible as a preliminary to establishing 
the fundamental result. 

Lemma 9 . Let coefficients of 

(|) Ln[y )■* y (*+») ■+ (*) y (*+ M -1) 4 • ■■ • + y W = o 

lc known (and be of the kind specified in the beginning of % i) in (i)+ •+('”). 
u s „bregion of I'. If the equation is Qfactorable in (l)+ • +N [Def 8; § i), 
„ point of division being between the I'-th and I'Y\ st tcnns (not belonging to the 
same logarithmic group 1 of the sequence 

Qt[x ),... Qn(x) ii*r<n). 


it necessarily follows that the equation is factorable, 

(, a ). L n (y) “ Ln-r Lr(y) = o. 

sq that the coefficients in the operators L— r {z), Lr(y) are of the same kind as in (i). 

With the e Q J {,) Sj(x) (7—■ I._w) denoting a linearly independent set of formal 

solutions 0/(1). the factorization (l a) can be so effected that the series 

(lb) eWs l (x),...eW Sr {x) 

are formal solutions of Lr(y) = o. 

Proof. In connection with the system Y(x+ \)=D(x) Y(x) (6; § 1), asso¬ 
ciated with (1), functions y r tJ (x) are defined by the product 

( 2 ) Y'{x )=(*;,(*)) 

- D(x- 1)... D{x-r) T(x-r) 

where T(x) denotes «(*)[=* (e^ w ^))-(^ (x+# " ,, ^(- r +«'-'))I with the ** eries 
factors in the involved elements terminated after, say. k terms (Xr being suffici¬ 
ently great). In accordance with the notation of § 3 we write 

' Of. 6" (p. 213; I*. 
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(2 a) 
where 


!?,.(*). ••• *?,/•(*) 
y< r i(*), - y# r r(*)| 

•1 < », < ••• < »> and *„ n. 




Since, by hypothesis, 

W,(*) + + <?'r(r)] £ »!<&.(*)+ ■■■ + 

(it<•••<>/•“ * >n ( 0 + ••• +(»)) 

in virtue of Lemma 5 (§ 3) the limits 

(2b) Umjlf, , r;1 ,(*)-»,. , r: , .» 

(«!<- <«>- 

will exist in (1)+ +(m) and will be analytic in this region; moreover, the 

asymptotic relations 

(2 c) . t r : 1... r(x) ^ • • rlx) 

(ij < *••<!>— 


will hold in (1)+ ••• +(m). Form the operator 


( 3 ) 


here 
(3 “) 


L'-M = ( - >) r 


yM 

y(*+ 0. 


y(i+r). 


yl.W • • • y, r (*) 
y5,(*) •r.rW 

rr+i,iW • • ■ S'r+i.rW 


= 6':(x)y(x+r)+ +VJu.Wy(*+») + +»'' r (*)y(*); 


‘UriH-iry;. 




/*♦! 




From the way asymptotic relations (2 c) were derived in § 3 it follows that 

(3 i>) ?! .., +1 ... r + « Q,W+ ‘ ‘ ’ +QrW *1 • ■ • «.•♦* • • ««; 1.. r(s) 

(5 = 0_ r, x in (1)+ ••• +(m); r=i, 2, ...). 

Hence an equation Z/'(y)=o will possess in (l) + — +(#*), formal solutions 
7 — 32fil I. Aria malhrmaliea. SO. Imprim* Xc t aeptembrc 1932. 
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««*«*> ,,(*), f^ ul *,(*)_ eUr ' t1 *rW- 

In virtue of (2 c) and of the fact just stated, if L'Ajt) denotes lim L',.[&)* t * ,, ‘ 
C(j nation 

(4) Vr[y) = l'My[x+ n+ ■•+^*)fW - o 

(6V-W = 1-1. ... t« . #*♦ 1. . M) 

will possess the same formal solutions. Here 

(4 a) i;i M 

(.?=o, t,... /*; r in (1)+ • +(»*)). 

In particular, the .v-series in the second member of (4 a) cannot be identically 
zero for .s=o and s=T\ this is a consequence of linear independence of the 
formal series. Hence 

6 ' 0 (x) * O. b' r (x) * o. 

Thus the equation (4) is actually of order /*. Dividing out the coefficient //„( ' ! 
wo write (4) in the form 

(5) M.v) ■ if{x + r) + • • + b,<— (r) .*(.#; + *) + • + b r [r) //(.#) ■» o 

(f»/w,(x) — ///_,U)/l/ 0 (x); je«“0-/— 1). 

The coefficients in (5) are analytic in (1)+ + (**); moreover. 


(5 ») 




».»n / * lj 1 jfa) 

*1 /.i M 


W 4 


(«=o_r— t; x in (l) + • • • (ml). 


Now, the formal series s x .r+i:i ..#ir) («=o, 1_ V) will contain no 

logarithms since the columns in the formal determinants 


*.•**. r*i: 1 ■ #*(•»■) 

can be so combined as to get rid of these logarithms (Of. I; in particular, 
pp. 213, 215). It is clear, moreover, that there will be only rational powers of 
.*• present in the formal series ,?,_(.#-) since the constants r occurring in (7 a; $ 1) 
differ by rational fractions in the consecutive formal series (7 a; i? 1) in any 
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group (I; p. 213) of such series (containing logarithms). Thus the operator 
L r {y) has all the properties required by the lemma. The factorization (1 a) follows 
immediately. The coefficients in the operator L*-r(r), 

( 6 ) L„-As) = s[x + n—r) + c^x)s(x+n—r- l)+ • +c*-Ap)z[x), 

will be analytic in (1) + + (m) and will be of the required character in 

(i)+-"+(m). The lemma is therefore proved. 

The equation L n -r{*) = o will be formally satisfied, in (1)+ •■■+(»»), by 
the series 

(6 a) e 9 r+, .W a r .^(x) (= LrW*r+i* w 5 r+ M (x)) 1 

(fi« I,... n- r). 

On taking account of the established nature of the bn-,(x) (.*=0, !,... T—i), 
it is seen that the series 0/».(x) are 5 -series. 

§ 7. On Products of Completely Proper Operators. 

The following theorem will be now proved. 

Theorem II. Suppose that the set 
(1) Q.M, 

belonging to on equation 

(1 a) L n (y) = O, 

has a point of division in (1)+ ••• +(m). Here, as before, (l) 4 - ■■•+(»») is a sub- 
region of /' the constituent regions (1), ... (m) being separated by B' curves. Assume 
that corresponding to this point of division tee have 

9t Q\(x) > 9i (fr+M 

(A=i, ... /*; « = i, . ..n— /*; x in (1)+ ••• +(m)), 

where an equality sign is admitted on the boundary of (l)+ + (w). With the 

coefficients in (l a) of the right kind (Cf § 1) in (l)+ • +(m), let 

1 Whenever a formal series is formed by writing LrifQM #(*)) («(x) an *• series) the coeffi¬ 
cients in Lr, if not representable by convergent series, are replaced by the formal series to which 
these coefficients are asymptotic. 
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(l £ / < ••) 


52 

(I b) 


he the corresponding factorization, as specified in Lemma 9 (§ 6 ). Suppose, more¬ 
over, that in (m) (or farther to the left) there is a carve F which is proper with 
respect to the set (1). 

It irill necessarily follow that, if the operators £„_/•(*), L,\y) arc completely 
proper (I)ef 6 ; § 1) in (1)+ • +(m), the product L n (y) will be completely proper in 

(0+ +(”»)• 

Proof. As an immediate consequence of the hypotheses of the theorem 
the following: is true. 

The equation 


(2) Lriy) ■ y(x+ l') + b l (x)y[x + /'-!)+•- + b r {x)y(x) - o 


possesses, in (1)+ ••+(«*), I linearly independent formal series solutions 


(2 a) e’^sjix) 

The related system of order /' 


O-l.... / ). 


(2l») 


}V0 + .)=7) / U) 17M. 



possesses, in (1)+ f(»n), a formal matrix solution 


(2c) 5 /U) - {W's^x)) - + 1)) 

[ij -1,.../) 

(the s t j(x), .^series). This system is satisfied by a matrix solution 

( 2 ‘» r*w = (jfj(*)) = (,7(x+i-i)) ( 

consisting of elements analytic in (a) and of the asymptotic form 
( 2 e ) *'•(*) ~ S/i*) (* (o)\ 0=1,... »i). 
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Of course, the elements in the first row of Y°[x) form a fundamental set of 
solutions of (2), and conversely. Bv hypothesis, .*ich matrices >"(*) (**i,... wi) 
exist so that the matrices of periodic functions, defined by the relations 

(3) Y’(z) = Y'"{x) JP(x) (/<'(x)-(rj(x))). 

are asymptotically representable as follows 

(3a) /?'(*)>7) 

(the rjj , integers; .«= i,... m— t; 3 x ^ 9 > o) 

for r in any region as in (Def. 5; § 1). Here a constant r* is not zero unless 
the corresponding function i-J(z) is. 

Letting have the same meaning as in § 5 and reasoning as at the 

end of that section we conclude that, for & g > o, 



s—l,... mi—1; 3x=3x,; 9i(x—x,), integer; x, in V t , t +\). 

Here k can be made arbitrarily great and the cr^(x) are bounded in V,,,+\. x 
Ii\ the right boundary of (#), while a B' curve for the set (1), may be not a li 
curve for the set Qj[x) (j« 1, ... 0- In such a case (rj(x)) can be taken as 

/( - (<M). 

Analogous facts can be stated concerning the equation 
(4) U-M * *(x +11- 0 h c,(x) z(x + n+ T-1)+ • ■■ • + c»-r(x)^(x) - o. 

In (1)+ •••+(»») it possesses n—F formal solutions 

(4 a ) 0r+iS* < 7 />„(x) = L r [e Q r+n {x) s r + H {x)) 

(/«=!,... n-r. the a r+ ^(x), formal 5-series), 

the c^ (/, 5j(x) (j= i,...m) constituting a linearly independent set of formal solu¬ 
tions of (1 a). The related system of order n—T 

' Relations 3 b) would continue to hold if the operator L r y) were merely proper (Def. 4 ; § *'■ 
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+ l) =- T)n-r(x) Zn-Ax), 


D n -Ax) 


O, O, I . . 


—Cn-Ax), ... -f,(4 


will possess, in (l)+-• +(i»i), a formal matrix solution 

(4 c) S h -Ax) = (e y '-*> ul Or+i. r*j[x)) = artj ( r + /- 1)) 

(i,j= I, — M— r) 

(the Or+i, r +j(x), ^series). There exists a matrix solution of (4 b) 

(4 d) - K+M + »-«)) 

with elements analytic in (0) and of the asymptotic form 

(4 e) Z*(x)~ S m -Ax) (* in (0); 1.... w). 

Matrix solutions, like these, exist with the additional property that the matrices 
I M [x), of periodic functions defined by the relations 


/'(*)= Z—(x)P'(*) 


are of the asymptotic form 


(the ]>' r + / r +j. integers; s= i,... m— 1 ; 3* ^ p > o) 

for .r in any region as in (Def. 5; § 1). Unless a function y;',(.r) is identically 
zero the corresponding constant pfi is not zero. Moreover, for £ Q > o, 

Is b > (Pr*i.rtjW) = ;(*.». 


K, tr .iW) “ (<« 


(v— 1,... iii— 1; 9 i(z—x,). integer; x, in 

In the above X can be made arbitrarily great and the ,i: U) are bounded in V f l ,, 
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A process of group summation will be now applied for the purpose of solv¬ 
ing the equation 

(6) Lr(y) = ^(x) (i S p £ n-l'; i ZsZ in). 

Let Z denote the matrix of order T 


(6 a) 



In each of the several rows the elements of the matrix 


(6 b) 







are the same. An element of the first row will be a solution of (6); denote it 
by yj. +/ 4 fcr). This function will be also a solution of (i a). Writing Y’~ 1 (x) — 
= (0JM) (*. I,... r) we have 


( 7 ) 





Here the summations are to be suitably determined. Consider a summand 

«r(* +, )4fnM- By(3) 

(7 a) r*W-r« (x)R(x) [s<ef*m) 

where 

K(x) = (r„(x» = R'-'ix) R'-’(x)... i?'(x) 

and 

(7 b) -«***“» r «*r8 

(/. j= i. -.. r; the r, i% integers; 3 * ^ e > o). 

Thus, noting that 
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(? v {x)) = R-'toY'- l (x) 

{R-'{x) = Mx)) ~ («■»’ =iVf#; I'^W = 0$)). 


we have 

( 7 d) ft r (* + 0 (* / ’ , • 

On the other hand, by (5), 


•»=! 


(B) 

where 

and 

(Ba) 


Z'(x) = Z‘(x) P(x) (s < e £ »*) 

P(x) = (pr+,./>,(*)) - P'-W P'-’W ... P'(x) 

Pr+f./>j(x) - e**'- lp r+t.r+j*p* rHi r+j 
(t,i— I. • • • w-P; the Pr*i.r+t, integers) 


in any region extending indefinitely upwards, as in (Def. 5; § 1). Hence 

»—r 

(8 b) P r+ „(0 - 2 '♦I'M- 


With the periodic functions in (7 a) and (8) formed for e»»i and using 
relations (2 e), (4 e) we conclude that, for x in (m), 


(9) Kr^+O^W" 

<*" 1 W-I 

~ 2 2 e Qr * w ' alx) *' 2 ’'' /•♦/-)* FicpU*. r+ h 0r+»(x)8«/'(x) 

«=l tr=l 

... 

In the above the .v„/>(x) are s-series defined by the relation 

s?W = W*9&t)r' = (c-wt.M 

(i,j= «,...r). 

This is a consequence of the fact, pointed out before that |(fy(x))| (*, 7=!,.../') 
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has no logarithms present. In a strip V m , »+i of, say, unit width and situated 
in (m) near the right boundary of (m) the real part of some exponent, 

(9 a) Qr+* . ,(x) f 2 n V — l (fit + p r *v. r+p)x, 

corresponding to non zero constants f«, pr*».r+ H is equal or greater than the 
real parts of all other exponents (corresponding to non zero constants r*, //). 
Let a=a, tc=tc be subscripts for which this occurs. Then, for x in 

(io) ft r (* + i)4+ m (x)~ 

o r+ -‘(x).V/-(x) + p; / ,(x)). 

Now, by hypothesis, to the left of there is a curve /*' which is proper 

with respect to the set (i). Consequently along every path, extending to infinity 
and lying in V mm + , the function |e Q /w.* w | has a definite order with respect 
to in fact, the exponential factor in the second member 0/(10) tv ill 

have the same property. The expression #„(x) stands for a sum of a finite 
number of products of the form 

(10 a) />(x)s(x) 

where |/>(x)| = 1 and s(x) is a formal ^series. Such terms may be present only 
if there are more than one exponent (9 a) with the same real part (which is 
greater than the real parts of all other exponents). 

Suppose again that s < e £ m. Consider (7 a), but in place of (7 b) use 
(3 b). We have 

(") MWT - PJM) = 



where the aj(x) are bounded in F f> and k x can be made arbitrarily great. 
Thus 

(11 a) (^(x+.)) = (P 0 W)(i/ 0 (x+!)) 

= frjW)frJ +l W) • • • 0) 

8 — 32511. Ada maihtmalira. SO. Iinprim* If 3 •eptfmbre IS32. 
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' V -^ 1 V 

• &w ffiM • • • &-.•*<* + 


Now, by (2 e), for x in (e) 

(yj(x)) - = (e^%(x)) (*.>= i. • • • r) 


where the fy(x) are s-series. If we write 

(i i b) ($J(*+1)) - (rW*$(*)) (x in (e)), 

it is apparent that the $\j(x) are s-series. The relations (na), (lib) give the 
asymptotic form of the yj(x+ i), for x in (e), in such a way that the exponential 
factors are explicitly given in terms of the 4 Mx) lT). 

Similarly, by (8) and (5 b), 

(12) ( 4 + i. />,(*)) = Utm. r»iW) (Pr+i. rejW) 

“ (*/>#. r+>( z B (Pr+l. r*jM) • • • ^r+i. 

h-r 

2 j)> • • ’+Qr*j. rf •,_,(*#) • 

(1, i — i,... 11—r). 



For a; in (<*) the 4 +/.r+/(*) sat * s ty the asymptotic relations (4 e) (with a=c). Thus 
we have an expression for the asymptotic form, in (e), of the /4 ,(x) in which 
the exponential factors are in terms of the (>r+j(x) (/— I....M — T). 

Consequently, for x in (e) (s < e), the following asymptotic relationship 
will hold, with 1 is X SS 

(13) 

’ , “ r **-«:*.. 

2 e Or* 0 ,(x). 

,*l *1 • ■ • *f—■ *■I 
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Here 


(« 3 a) 

+ i c _ # _|(Xr-i)) +1 Qr+c. /-+«,(*<-■) + r+*(*e- j) + • • • + Qr*?, r* «v_,(x»)) 

where 

(i 3 b) QJM = Qjt{x) + log ej(x) 

#£*) 

and 

( 1 3 c) Qr+). rn(xg) = Q r *j, r+ifa) + log «J. +( r ^(x,) 

(«,;= 1.. ..n-r). 


The expressions involved in (13 b), (13 c) are logarithms of corresponding periodic 
functions. We take suitable determinations of the logarithms. Whenever a periodic 
function is identically zero the corresponding term in (13), or in any similar 
sum, will be zero. The summation signs in (13) will be considered as extended 
only over those terms for which the periodic functions are not zero. Only 
superscripts and subscripts corresponding to terms actually present will be con¬ 
sidered. 

In the sum (13), for any given x in (e), there is a set of subscripts and 
superscripts 


(13d) 

such that 


Wi-** >■<-•) = (r.... avj, (o,... o_.) - (o',... 0<-.) 


13 e) 




((A,... )-c—i, o,... Of—i) 9* (A,A f-«i 01... 0 ^-f)i A,... Af_j — 1 ... / j 

o,... Of—* = I...W—r). 

Accordingly, for x in (e) {$ < e), 

( 1 4) $!r(*+ 0 «!hpW~ 

r(x)o r ,.■,(*) + ) 

(iSImST; i S o’, S n—r) 


where... stands for a sum of finite number of products of the form (10a), 
which may be present only if there are more than one set of subscripts-super- 
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scripts (13 d) for which (13 e) holds. In the 9equel the inequalities satisfied by 
l' ( _ t and a\ will be found essential. 

In ($) the asymptotic form of i/ lr {x + i)^. t/1 (x) will be (Of. (2e), (40)) 

(15) PiM + 04 +^W ~ s 1 i\x)Or 

1; I^r). 


Since by hypothesis, for x in (1) + ••• + (m), 

9i $',(*) >9i«7*„(x) 

(except possibly along the boundary) so that 

N Q'r+„.i[x) < o. 


it follows that 
(I5a) 


91 Qr+p.i{x) > 9 ) 

(3* “3*; 91x<9ix') 


whenever x and x arc in (1) + ••• + (m). 

Define the function Lr+,,.k(x) as follows 


i£/iSn-I 





(x in 

to). 

(16) 

£/•♦*■. a(x) 

K: w'w 

(x in 

to; *<<*S»»») 

By (15 a) 





(16 a) 

'KLr.^.a(x) > 9 iZ.r*#i.*(xO 




( 3 x = 3 x; 

91 x < 91 x'; x, x' in 

(*))• 



It will be proved now that, more generally, 

( 17 ) 91 £/•♦#.. a (x) > 9iLr+*.i(x') + f 

(3*-3*; 91* <91x0 

provided that x is in (s) while x is in (*) + •*• + (m), say, in (e) 
while $ is a real magnitude negligible in a sense to be specified below, which 
for x in (s) can be taken as zero. Assume for a moment that (17) holds. In 
virtue of (16), (13 a), (13 b) and (13 c) the inequality (17) can be written in 
the form 
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(17a) 9i Qr+t.M > 'K { Qr »✓,. + Kfn (x.) + Qk\i\ (*.+1) + • • 

+ Qk' e -.- l * e - t - J [x<-i) + + \Qr**-,.r+s,{x<-\) 

+ Qr+o’,. + ••• + Qr* o' f _, r+(x,* 1) 

+ Gr+j..r+*V_,(x,)|) + §, -t § 

where 

(«7 b) S, = log |e 5 ri Wfjy. t (**,)... 

+ ,0 Rl rr 7 i^ 1(r * r ' 1 (x,-i)... 

Since (16 a) holds it is necessary to consider only the case 1 < e. The points 
x,... Xt-i, while correspondingly in the strips F*.,+i,... K«-idepend on 
x (x—x„ ...x Xf—i are integers). By (11) and (5 b) the real function may 
approach - « as |x'|-*oo; but, in any case, 

(.70 

where |6(x')| is bounded and k can be made arbitrarily great. Let {«—{,. 
If (17 a) is demonstrated, with the inequality (17) will have been de¬ 

monstrated (with §=—§,). Regrouping terms in (17 a), with the inequality sign 
displayed tentatively, 

(17 d) N Qr+n. i(x) > 9 * { Qm u *v—+ KW*r f __,(x,-i) - Q r +*„ r^lxr-i)) 

. + “ Qr + #' ( .A- f _ # _ I (x f - 4 )l + ••• 

+ ( Qr* o' ( a*, lx, ♦ 1) — Qr+/,_ # _ I .r l (x, + i)] + (<?r+#..2(x,) — ^r+#' < _ l ,r l (x,))} 
or 

(17 e) 9i ( Qr*?. i(x) — Qr*?. i(x,)| > 91 {[ Qr+*„ r^x 1 ) — Qf+✓,. r e _,(xe-i)) 

+ I’f-.-^Xf-i) - Qr+o’+ + [Cr+«' fc i' e _ # _ 3 (xe-i) 

“ Qr+o ’,.+ (Cr+*Wi , l (x# + i) “ Cf+«V-,.*' l (x#))|. 

Now 

(17 f) 3 x= 3 x,= ox, tI =---= 3 xf_i= 3 x', 

9 ix < Six, < 9 ix,+i < ••• < 9 Jx«_i < 9 ix\ 
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The functions @(£), w > fc b double subscripts, occurring in (17 e) have increasing 
real parts as £ moves to the left parallellj to the axis of reals and within the 
region (1) + + (m). This follows from the fact that the first subscript, in such 

a function, is greater than T and the second subscript is equal or less than l 
(see 15 a). Taking account of (17 f) we conclude that the left member in (« 7 e) 
is positive while the real parts of the differences occurring in the square brackets 
in the second member of (17 e) are negative. Hence (17 e) holds. The steps bj 
means of which (17 e) was derived are reversible. Thus (17) is demonstrated with 
£=-£, ($ = o in (s); (17b), (17c)). 

In view of the preceding we are led to consider a summand 

(18) ?iAx + = «*«*(*) 

where L(x) (= L r ^,i(x)) satisfies (16) and (17) while 

(18 a) h(x )- H[x) (x in (*) + ••• + (m)). 

Here the formal expression H(x) is a formal s series (// 0 (x) = 5 'ar(x)o r+/ *(x); (15)) 
for x in (s); in (e) (s < e 5 m) H[x) is a sum of a finite number of formal expres¬ 
sions of the form (10a) (J*(x) — $V # _*.r(x)ffr+#',(x) + •••; see (14)). Furthermore, 
H(x) is analytic in (*) + ••■ + (m). 

In this connection, as well as throughout, when we say that a function is 
asymptotic to a formal expression it is meant that the function is representable 
by this expression with the power series factors terminated after a sufficiently 
great number of terms, while in place of the discarded terms expressions are 

introduced of the form (|6(x)|, for k fixed, bounded; * arbitrarily great). 

In order to obtain an evaluation of 

*—* 

that is, a solution of 

(<9> *(*+!)-*(*)-«*<•»*(*) 

consider first the formal equation 

(19a) y(x + 1) - y(x) = JMH'ix) (= 
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By a Lemma of Birkhoff, previously quoted, such an equation certainly will 
possess a formal solution 


(19b) 


*(*) = *««#(*). 


where .»(x) is an 5-series. Let <(x) denote with the power series factors ter¬ 
minated after m terms, m being sufficiently great. Substitute 

*(*)\ 


(20) 


+2?) 


(A- = m'/p) 


in (19). The new variable z[x) will satisfy the equation 

(20a) q[x + i)z(x + 1) - q(x)e[x) = F[x) [q(x) = e <?|,, x“*) I 

F(x) = e L ^k(x) - Je™t(x) = l\(x) + F t {x), 

F x (x) = e L ^h(x) - e<>M «',(*), F t (x) = ifjp) - Je«'’U(x) 

where It 0 {x) is H 0 {x) with the power series factors terminated after a suitable 
number of terms. The function lt 0 (x) is analytic in (*) + ••• + (m) and 

(20 b) It 0 [x) ~ H 0 (x) (= S'ar(*)ar*,(*)) 

[x in (#) + ••• + (m)); 


hence 
120 c) 




where f t (x) is analytic and bounded in (*) + ••• +(m), and k" can be made 
arbitrarily great. On the other hand, for x in (#), 


[20 d) 


F,(*) = ^^(lA(*)|S/»,V 


where A' is arbitrarily great. 
Writing 

(2i) §*W 

t-x 


!»z f=x 


(0 


we 


evaluate ^ /'’,(/), precisely as in § 4 by means of a contour integral with 


1-2 


1 The same remark can be made to hold for /?,(x), /?,(x) as previously made regarding /?(r) 
of (3a; § 4\ 
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path of integration in (m) near the right boundary of (m). Thus ^ F a (t) may 

i »i 

be considered to be a known function, analytic in (*)+•••+ (m) such that 
(21 a) 

>“ M + — + M; — 00 as *-*«)•' 




l-X 


Let x be in (s). Write, in accordance with § 4 (Cf. (5)), 


(22) 





where L t is a contour, formed as in § 4, with the constituent part L situated 
in (m) near the right boundary of (m). The integer X is determined depending 
on the order (with respect to along L, of the exponential factor of 

Existence of such an order may be ascertained as follows. We have 
F t {t) " e’ AI) h(t) — e Q{l, H' 9 {t). A strip V of limited width can certainly be found, 
.in (»i), near the right boundary of (ro) so that throughout V either 9 iL(f)£ 4 H 0 

or W(>(f)£SK£( 0 - Confine L to V. Taking account of the statement in italics, 
following (10), and noting that by hypothesis Q(t) (= £/•**. i( 0 ) >9 proper in V 
(since the proper curve /•’ may be supposed to be to the left of V) it is con¬ 
cluded that in V the exponential factor of F,(<) has a definite order with respect 
to 

By (17) 

9U(*T-a$(*)<-*«| I 
■ C$x = $x'; JRx < Six'; x in (s); x in [s) + •• • + (m)) 

where £, is small (see 17 c). Also, by (15 a), 

Ng(x')-9i<?(xKo 

{ 7 sx = %x ; 9 \x < 9 tx'; x in (s); x in (*) + ••• + (w)). 

As <R(x' — x) increases, these differences diminish sufficiently rapidly to secure, 
in virtue of (20 d), the following relation 

1 Here p ,(x) is of the same nature as /?,(x) and 
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(22 a) 




Thus 


( 23 ) 


> n W; k\-oo aa *-»»). 

^ F(<) may be considered as known and analytic in («) and such that 


(| 0 (*)|S/J W; as *-*00). 

A solution *(x) of (20 a) may be given as follows 


(24) 


i(x) = z*e-<M\F(t) 


where §™ is given by (23). Thus, this solution may be considered to be a 


l-x 

known function, 


(24a) ,(x) = *-'(l( x ) (|/?(x)|*fin to), 

analytic in ($). Using this determination of *(x) and (20) we obtain an evalua¬ 
tion of ^e’' V) h(t) as a function analytic in (9) such that, to m(A) terms 


1-* 


( 25 ) 


§ 


^h(t)-e^s(x) 


(x in (s)) 


where $(x) is the proper series of (19 b) and as k -*». 


The integer l in (22) we deBne as follows. Let e V{l) be the exponential 
factor of F,(f) (for t in V ). If s<m let X be the greatest integer such that, as 
| /1 —* oc in V, 

(25 a) 2rr(*—1)1; + 91 P(<)-* — co (t»= 30 - 


If .v = m let X be the least integer such that 

(25 b) 2xiv + 9i y(t)-* + o°. 

9-32MI. Ada malhmalua. 60. Imprim* le 3 -eptembr© 1932. 
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By (25 a). (25 b) it follows that, for k fixed, (25) holds to infinity of terms 
(Cf. § 4 ). 

As a consequence of this method of summation and of the asymptotic 
forms of the yj a (x) (*= 1, •••/’), known in (s), by (7) it follows that, for x in (5), 

(26) ~ e Qr *^. ^series. 

The second member of (26) is apparently a formal series solution; it has 
the same exponential factor and, on that account, necessarily essentially 
the same 5 -series factor as in the, originally known, formal series solution 
gQr+h {z) s r+ft [x). This construction can be effected for p = 1, ...n — T and for 
5=1, ...m. Thus, the operator L*{y) is proper. 

§ 8. Completion of the Proof of the Theorem of § 7. 

It remains to prove that L„(y) is completely proper; i. e., that the periodic 
functions, connecting the m proper matrix solutions of the system ((6); § 1), 
related to (l a; § 7), 

(1) + •-.)) 

(1,7— 1, ... n; «=» 1, ... m), 

are proper. We have 

(1») r*w - six) = (.■*'%(*)) - + , -,)) 

(* in (*); #— 1 , ...m; i,j= i, ...n); 

(lb) Y‘{x)- K-*'W6-(x), G*(i) — (yj^*)) 

(5=1, .. m - 1; G'(x + 1) = G'(x)). 

Accordingly,.for s < m, 

V(x) - y(x) Q(x) (<?(*)-(*,(*))), 

(2) 

G(x) = <?—*(*)... G'(x). 

It will be proved first that the «7<>(x) are proper periodic functions. 

Consider as given by (22; § 7). This is a function analytic in 

l-z 

(5) + (rn); in (5) it is given by (22 a; § 7). Let x be in the strip V (§ 7) 

to the left of L and, of course, not nearly congruent to L (when a position of 


625 



Analytic Theory of Singular Difference Equations. H 7 

congruency is approached L is suitably shifted). Taking account -of the fact 
that Q(x) and L[x) (see statement in italics following (io; § 7)] are proper 
(Def. 3; § 1), in V, and that F,(t) is given by (20 a; § 7), it is concluded that, 
for x in V, the exponential factor of 

r * •*=>*<*-* r,(t)dt 

L x 


is comparable with Taking account of this fact, of (21a; § 7) and 

of (21; § 7) it is observed that the exponential factor of z(x) (see (24; §7)) has 
the same property for x in V. 

Hence the function y(x), as given by (20; § 7), that is, 

04 + ,W Os**' - ; ■ £/<£»-/•) 

r-x 

has its exponential factor comparable with \e tm ^~ lx \ for x in V. This func¬ 
tion is analytic in (*)+••• + (m) (to the left of L). 

Consideration of (7 a; § 7; e = m), (7 c; § 7; e = m) and the fact that the 
Qij{x) (*’. j “ 1, ...w) are proper in V make it clear that each of the functions 

*J(*) (= yJiW) (A - 1, ... I', see (7; § 7)) 

has its exponential factor comparable with for x in V. Consequently, 

an element 

#♦„(*) (1 £n- T; (7; § 7)) 

will have the same property. 

Therefore 

( 3 ) Y‘{x) - S*(x) [x in V) 


where the elements of the matrix 6 ,# (x) are formal series with exponential factors 
comparable with \e 7!xi “''I for x in V. On the other hand, 

Y*(x) ~ S(x) 

for .r in (m) and, in particular, in V. Hence, by (2), 

G(x) ~ S-'fcc) 5 *(x) = l'{x) [x in V). 
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The exponential factors of the elements of the matrix <S“'(x) are the e Qj ‘ U) and 
hence are comparable, in V, with Consequently the same will be 

true of the exponential factors in the formal matrix T(x). 

The elements Of G(x) are analytic in an upper half plane and, by what 
precedes, they are of the asymptotic form 

(4) G(i) = (?y(*» ~ (4*«r = i*u« gfj) 

(the gij, integers; gij, constants; %x ^ q > o) 

as in (Def. 5; § 1). Thus the gifix) are proper periodic functions. 

Write for this matrix G(x) 

G(x)=G-(x) = (sC/W). 

The matrices G m, {x) ($=i,...m — 1) all consist of proper periodic functions. 
A matrix G'(x) (i£s£m — 1), occurring in (1 a) is representable as 

(4a) G'(x) = bT/* +l (xr»(*•/(*)) (s= i,... m-2). 

The determinant |(0"/'* , (r)| could not vanish since otherwise at least one of 
the set of determinants 

|G"—(z)|. |G—Ml.... |G**'(i)| 

would vanish. This, in virtue of (l b), would imply that not all the matrices 
Y°[x) (o=i,...m) are fundamental matrix solutions. Thus, the elements of the 
matrix are proper periodic functions. The same will be true for the 

elements of the product (4a). The matrix G" _, (x) is really G m ' m -'{x), and its 
elements are also proper periodic functions. It is seen, then, that the elements 
of each of the matrices G*(x) (s=i, ...m—1) are of this type. Hence L n (y) is 
completely proper, and the Theorem is proved. 

An application of Theorem I (§ 5) and of the methods of § 5 will yield 
the following Lemma. 

Lemma 10 . Suppose that the conditions of Theorem II (§ 7) hold. Assume, 
moreover , that the coefficients in L n [y) are known and of the right kind (see § 1) 
not only in (1) + ••• + (m) but also in a more extensive subregion of T, 

(1) + •• + (m) + ••• + (17) (r)>m). 

It will necessarily follow that L % (y) is completely proper in (1) +-h (17). 
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§ 9. The Fundamental Existence Theorem. 

The results of the preceding sections have prepared the way for the proof 
of the main result of this paper. This result is embodied in the Theorem. 

Fundamental Theorem. Every equation Ln(y) = o (or system), with 
coefficients of the kind specified in § 1 and known in the complete neigh¬ 
borhood of infinity, is completely proper in each of the several quadrants 
associated with the equation (or system). 1 

Proof. It is sufficient to give the proof for some quadrant. Let r be 
this quadrant. We may assume the implication of the statement in italics pre¬ 
ceding Lemma 1 (§ 2). The methods so far developed indicate how to meet the 
situation when the condition of that statement does not hold. In fact, it is 
sufficient, whenever necessary, to replace a multiple' B" curve by several cor¬ 
responding ’simple’ & curves, ’parallel' to each other. This is always possible 
since, as stated before, B 1 curves are to be considered to be determined except 
for a translation. 

Certain terms, embodied in the following definition, will be found con¬ 
venient. 

Definition 10 . A set 

which has a point of division in a region G between the T-th and r + 1 -st elements 
of the set, will be said to have a point of separation, in G , if 

( 1 = 1 , ... r, p= 1, ... n — T; x in G). 

The Qfactorization corresponding to a point of separation will be called Q*fac¬ 
torization. 

We note also the following simple fact. If 9 l#(x) 5* $R 4 S(x), interior 
to G (along some curve) 9 i Qi(x) = ® $(*)• then in 80me portion of G 

9 * 0 (*) >*«(*), 

1 A similar result will hold, for more restricted regions, when the coefficients of Ln{y) (or 
those involved in a system) are of the right kind (§ l) in certain portions of the plane only. 
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while in another part of G 

*«W< *«(*)• 

If the theorem is not true, there are equations L„[y) = o (n £ 2) of least 
order n for which the theorem fails in J\ Let 

( 1 ) L,b) = o 

be an equation of this kind. If the set of (J(x)’s 

(la) Qii*), ... &(*), 

belonging to this equation, is such that SR Q x (*) = 9 i Q\ (x) ■ ■ ■ Q' n [x) the 
equation will be seen to be completely proper in /* (see Theorem I; § 5). Hence 
not all the 9 ?$(x) (;= 1, ...n) are identical. 

Suppose now that in T the set (1 a) has a point of division. Necessarily 
there will be a point of separation in J\ By Lemma 9 (§ 6) there will be a 
factorization 

(ib) U[y)-L n - r LAy) = o 

(1 2Sr<n; x in r) 

corresponding to the point of separation. In /*, near enough to the positive 
axis of imaginaries, there is a curve F which is proper (Def. 9; § i) for the 
set (1 a).‘ Now, the operators U-r{z), L,\y) are completely proper in /\ since 
n-r<n and T<n. Theorem II (§ 7) is applicable and, consequently, L n [y) 
is completely proper in /*. Hence there can be no point of division in /\ 

Accordingly, assume that the set (1 a) has a point of division in (1)+ —h (m) 
(a subregion of T) and has no point of division in (1) + •• + (m + i).* Necessarily 
there can be only one point of division in (l) + ••• +(m). Since the 91 Qj[x) 
(j = 1, ... w) are not all identical this point of division is necessarily a point of 
separation in (i)+ - +(m). L n (y) is correspondingly ^'-factorable. Let 

( 2 ) £*(y) “ Lm—rLr(y) = o 

(1 sr<»; X in (l) + .- +(m)) 

be the corresponding factorization according to Lemma 9 (§ 6). By Lemma 3 

1 This will be true, of course, for any set of Q[x/s. 

* Incidentally, this would mean that By, = •■«= and . 
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(§ 2) there exists a curve F, in (m). which is proper for the set (ia). Suppose 
that L n -i\z), Lr(y) are both completely proper in (1) + ••• + (»i). By Theorem II 
(§ 7) it would then follow that L*(jr) is completely proper in (1)+ ••• + (»»). 
Furthermore, by Lemma 10 (§9). L„(y) would be completely proper in /*. Hence 
at least one of the above two factors is not completely proper in ( 1 ) + + (mi). 

Let it be denoted by />*. 

Consider the equation 

L'(y) = o. 

Its coefficients are known in (i)+ - +(m). Now, L'(y) is not factorable 
in (i) + "* + (m). There exists an integer m,,2£m, <m, such that L x {y) is 
^-factorable (and, of course, correspondingly factorable) in (1) + • •• + (»«,) and 
is not factorable in (1) + • + (m, + l). 1 This ^ factorization of L'{y) is 

necessarily unique in (1) + ••• + (m,). The belonging to L’(y) cannot, of 

course, be all identical. Hence this ^factorization is a Q* factorization. By 
Lemma 3 (§2) the set of ^(x)'s belonging to L'(y) will be proper (Def. 3; § 1) 
to the right of a curve F lt lying in (mi,). If both factors of L'[y) = o, known 
in (1)+ ••• + (»!,) and corresponding to the ^factorization, were completely 
proper in (1) + — f (in,) it would follow by Theorem II (§ 7) that L'{y) is 
completely proper in (1) + • + (mi,). By Lemma 10 (§ 8) L‘(y) will be completely 

proper in (1) + ••• + (m). Hence at least one of the two factors of L'(y) is not 
completely proper in (1) +••• + (m,). Denote it by L*. 

The equation 

L*(y) = o, 

with coefficients known in (1) + + (m,) is not ^-factorable in (1) + ••• + (m,). 
There exists an integer mi,, 2 S 5 mi, < m,, such that 7 >*(y) = o is 4 * factorable in 

(l) H-+ (mi,) and is not ^factorable > n (1) + ••■ + (mi, + 1). By the reasoning 

applied to L'(y) = o, previously, it is shown that of the two factors of L’(y), 
which correspond to the ^factorization (necessarily 4**factorization), at least 
one is not completely proper in ( 1 ) + • + (mi,). Denote this factor by L\ 

Continuing the indicated process, we obtain a sequence of integers ni ( 

2 ^ • • < m< < • • • < »i, < mi 

and a sequence of equations V[y) = o such that the following conditions hold. 
' An equation which is not ^ factorable might be factorable (in the usual sense'. 
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i°. U[y) is not completely proper in (l) + + (m/-,) (m 0 = m). 

2°. Ij{y) i 8 4J*f act °rable in (i) + + (mi) and is not ^-factorable in 

(i) + •■• + (m< + i). 

For a certain i (=0 we have m,- = 2. Correspondingly there will exist an 
equation L t '[y) = o with coefficients of the right kind (Cf. § i) in (i)+ • + (»>»/'-1) 

(2 < mi*— 1), ^-factorable in (i) + (2) and not ^-factorable in (1) - 4 - (2) + (3) . By 
Lemma 9 (§ 6) there will be a corresponding factorization 

L>'(y)mL t _ r L r [y) = o 

(!Sr<*;xm(l) + (2)). 

Necessarily the factors will be of order one and two. The It' curve, separating 
(1) from (2), will correspond to the two ^(x)’s belonging to the factor of order 
two; that is, the subscripts associated with this curve will be those of the two 
mentioned ^(x)’s. Hence, in view of Lemma i (§ 2), Theorem I (§ 5) will 
certainly be applicable to the factor of order two. Thus the two factors are 
completely proper in (1) - 4 - (2). Now L l ‘[y) is 4 J*-factorable in (1) - 4 - (2) and is not 
^factorable in (1) + (2) + (3). Hence, by Lemma 3 (§ 2), the set of 4 >(x)’s 
belonging to L'[y) is proper to the right of a curve F, lying in (2). By 
Theorem II (§ 7) the operator Z/(y) will be completely proper in (1) 4 - (2). 
Moreover, in virtue of Lemma 10 (§ 8), Z/(y) will be seen to be completely 
proper in ( 1 ) + ••• + (wr-i)- " r e have thus arrived at a contradiction. Thus, 
the Theorem has been proved for l\ 

For the several other quadrants (below the axis of reals, to the right of 
the axis of imaginaries, and for various ranges of arg x) the demonstration would 
be entirely analogous and structurally identical with the one just given. Thus 
the Theorem is seen to be true. 

It is essential to note that given any particular equation the preceding 
sections give actual methods for construction of those solutions and of those 
periodic functions whose existence has been established in the Fundamental 
Theorem. 

If we consider two adjacent quadrants with a common strip V along, say, 
the positive axis of imaginaries it is noted that the two proper seta of solutions 
corresponding to the two overlapping sub regions of the quadrants are connected 
by proper periodic functions. This is seen to hold because every set of ^(x)'s 
is proper in such a strip. I11 this sense, every equation L n [y) = o [or system). 
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with coefficients of the right kind in the complete neighborhood of infinity, is completely 
proper in each of the several upper and loicer half planes associated with the equation. 


§ 10. Connection between ’upper* and ’lower’ Solutions. 

Consider two adjacent quadrants /* and Q above and below the negative 
axis of reals, respectively. We shall write 

/• —(l) + (2) +••, 0-(l) + (2]+- 

(see § 3). Let K denote the combined region (1) + |i); this region extends in¬ 
definitely upwards and downwards from the negative axis of reals. It may 
happen that the negative axis of reals is a B' curve. In any case, (1) and |i] 
may be considered as overlapping along a strip //: 

(<>>o). \x\Zg>o. 

An ordering 

( 1 ) *<K4>;(x) 

will be maintained in A', if the negative axis of reals is not a B' curve. In the 
contrary case assume this ordering in the region (1) down including the nega¬ 
tive axis. The lower boundary of (1), h, will consist of a portion ot the line 
3 >x=- — q. The upper boundary of (l), h *, will consist of a portion of the line 

3x = Q. 

Let Y"(x) be a matrix solution, consisting of elements analytic in (1), such 

that 

(1 a) Y-(x) = (yj(*)) - S(x) = {cQi'"s,j(x )) 

(i,j= 1, ... n; x in (1)). 

Let l' 1 (;r) be a matrix solution, with elements analytic in (I), such that 
(1 b) r(x) = (yj > (x))- 5 (x). (x in [1]) 

The matrix V (x) (= (p t j (x))) of periodic functions, defined by the relation 

(2) r-w= r(r)/'(x), 

consists of elements analytic in H. By (1 a) and (1 b) it follows that 

10 — 32511. Acta mathrmatica. SO. ImprimA I© 3 scptrmbr© 1932. 
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(2 a) 


ipnix)) = (&*** + -jjr)). 


where k can be made arbitrarily great while the Qij{x) are bounded in 11 . 
Along the negative axis of reals the ordering (i) holds, that is 

si oii(x) ^ o. y< *') 


Hence 9\(?;,(x) (; < 0 *3 non increasing as |x|-*oc along the negative axis. In 
virtue of (2 a) and the periodicity of the pij[x) we shall have 

p, j(x) = Jim = ° 

(3x = 3 x = o; 91 x > $ x'; x — x\ integer) 


for 1 >j; on the other hand, 

pu[x) — lim pn(x') “ 1 

Since the pij (x) are analytic it would follow that 

(2 b) PiA*) m o 




(*>», 
1, ... n). 


Definition 11 . A matrix (h,j(x)) will be termed a half matrix if hi Ax) - o 
(» >j), while hi <(x) ■ l (1=1, ... n). 

Use the transformation 


*i« 

Write 

(2 c) P(*)-0« ■(*,(#)). 

The gij(z) are analytic for ^ ^ «*•*. Letting c, denote the circle \t\ = 

and e, denote the circle |*| —•*"* the following will hold for any func¬ 
tion g[e) analytic for t~ 7H * ^ |*| S? e* n *. 

g(z) = a(e) + 6(r), 
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where c t is described in the counter clockwise direction and c, is described in 
the clockwise direction. The function a(z) will be analytic interior to c t , while 
b(z) will be analytic exterior to c,. 

We seek to determine half matrices B(z) (= (6<j(c))) and A(c)(= ((<*/>(*))), 
with the bij(z) analytic exterior to ct and the atj(z) analytic interior to c*, so 
that 

( 4 ) P(*)« 0 (r)-BMAM; 

here the radius of the circle e\ is to be slightly greater than while the 

radius of the circle c* is to be slightly less than e *"*. 

From (4) we have 

(4 a) g tJ (r) = £ bn {z) aij {z ). 

!■! 

n n 

Let i >j. In the second member bn(z) = 0 for » > A so that 2 = In the 

*-l imi 

latter sum the subscripts of the axj(z) satisfy the inequalities 




thus these axj(z) are zero. Hence (4 a) is satisfied for i > j. Fori—jwe 
should have 

(4 b) 1 — 2 a 'i W (*«=!,... n). 

1-1 

The equations (4 b) are obviously satisfied since 6n(r) — O (i > A), au(z) = o 
(A > t) while hi[z) = a t t(z) = !. It remains to consider (4 a) for t < j. These 
equations take the form 

(4 c) g tJ (f) = 2 to* W aij W (t <j\ *,i— I, ... n). 

1-1 

They will be grouped, for 0= 1,2,.... as follows 


( 5 ) 


*+• 

9 ut +0 w=2 fa w 1*0 w (* ——0). 

imi 


An equation 


9 w +* W = a u*i W + 1 W 
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of the set (5; 0= 1) will be solved by letting 


isa) 




# X _ 1 f fU+l(Q*t 

«* 


to. m W 


2 _ ftoioJOJC 

v~i J c-* 


271 


r * 

These are functions of the desired type. The function 
(5 b) ffi.t+t W ■ 3 M — to.i+i (*) au i.itt W 

is consequently known and analytic in a closed Laurent ring (c],c\) slightly in¬ 
terior to the ring (c,, c f ). The set (5; 0=2) can be written in the form 

(5 c) gl <♦* (*) = at. (+7 (f) + to. m W 

(* = I, ... n-2). 


Solutions of this set, of the desired kind, are obtained by writing 

(5 d) «,,♦,«-■ f 9li± i ffl'C , 

2nV- 1 J 5 “* 

f S 


2nV - , J 


Suppose that desired solutions of (5) have been obtained for 0= 1,2,... m— 1, 
with reference to a Laurent ring (C“‘. ). slightly interior to the ring 

(c,,c,). As a consequence, the function 

|6 > »r».w s »(.«♦.(*)+ 

+ to.<+a (*)a<+*.r+»(*) + ••• + to.<+«-i 


will be known and analytic in a closed ring (c?, c?), interior to (c?" 1 , c? -1 ). 
The set (5; a=m) can then be written in the form 


(6 a) 


(• — 1,... n — tn) 
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and can be solved by writing 


(6 b) 


„ ; ( • r.~m*c 


. /• #<+■ (CM S 


Thus the matrix equation (4) may be considered solved as required. The 
elements of A[r) will be analytic for t and those of I?(r) will be ana¬ 

lytic for |;| £ e~ lHe (e > c > o). In virtue of (6 b) the a tJ (x) (= a,j(z)) are of 
period unity, analytic for — e and of the form 

( 7 ) a tJ (x) — *)* a*i + • • • 

(«U, integer, £0; 3x£ — c) 

where the second member is a convergent series in positive powers of 
Similarly, 

(7 a) = " 

{fits, integer, £ i; 3* £ c) 

where the second member is a series in positive powers of e~* ! ' v ~ l *. Conse¬ 
quently, the a tj(x) are proper in an upper half plane, and the flij{x) are proper 
in a lower half plane'. 1 That is, in suitable regions 

(8) «,,(*) 

[at), integer, ^ o; 3x ^ — c; a ti = o; a, # < *= 1 ; i,j = i, ... w), 

(8 a) 

ifrj, integer, ^ 1; 3x £ c; (in = o; fit = 1; i,j= 1, ... n). 

Besides, (a,j(x)), (flij (x)) are half matrices. It can be easily shown that («fj(x)) _l 
(B(( ftu (x))) also is a half matrix, with elements analytic for 3x^ — c and pro¬ 
per in an upper half plane; 


The meaning of the latter term is made obvious by analogy to (Def. 5: § l), which was 


given for an upper half plane. 
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(gb) 

(aij, integer; 3 z^ — c; au = o\ a*t= i). 

Examine the situation when 

( 9 ) 9* «.(*)—• = ««,(*) 

(a<j; x on negative axis of reals). 

In this case, by (2 a), it would follow that along the negative axis, and hence 
identically, 

(9 a ) /*•(*) = 0 (* < m; a £ », m sSj). 

It will follow that gj-tjU) ( b /hjW) = o so that, by (5 a), 

&i-ij(*) = o. 

If o£j— 2, by (9a), fl_jj(r) — O. The relations 

“ «m.jM = o 

would imply that 0>L*j(x) (see 5 b) is zero. This, in turn, would mean that 

<b-+.i W — W - o (see (5 d)). 

By induction, it could be shown without difhculty that (9) implies the relations 

(10) a 0tJ (x) = a 0 +,j(x) — • • • — oj—i.j (z) — o, 

p*.j(x) = P**u(x) = ••• = A-IJ (*) ■ O. 

These relations would necessarily follow if in place of (9) it were merely assumed 
that 

(10 a) JR &>-» (z), 9 ? Qj.j-*(x ) t ... JR <&.(*) <S k 

along the negative axis of reals. 

Now, for s = 1, 2,.. 

i+*-i 

( 11 ) 2 x M <+« (*) + (*) + a t , t+ , (z) = o 

i-m 

(»=!,... n —s). 

Suppose (10) holus. By (11; $= 1) 
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(i I a) «/./♦! (*) = — ai.i+i (x) (i — I, ... w — i) 

so that, since aj-ij(x) = o, we have a>_ij(i) = o. By (■ l; .v=^2), for i = 

= I, ... w — 2, 

«(*) + + «/. r*i (*)«/♦ 1.1+* (*) —O- 

Thus 

flHjW + “)-*.)(*) + “U-iW a y-uW = O 

so that, if in (bo) o£j — 2, necessarily o>_ij(jr) = o. By induction it follows 
easily that the first line in (io) implies that 

(1 2) a.. f (x) = a«+ij(x) ==••• = a>_ij (i) - O. 

TVims, (ioa) implies (io) awd (12). 

Consider the matrix Z(x) (= *<>(*)) defined by the relations 

(13) r-(*)(£„(*)) = r(x){fi,j(x)) - Z(x). 

This is apparently a matrix solution. Its elements are analytic in A'(= (1) +(i|). 
This follows by (2) and in virtue of the relation (4), 

For x in K and 3 * — e, on account of (1 a) and (8 b), 

(13a) *>){x) = jfciM + 1 + r tJ (x) J. 

(i3b) r ti [x) = 2 e^ u,+t - 1 * ( t, * (x) + («, + ■••). 

Here denotes J|j (z) with the power series factors terminated after A terms 
(X- sufficiently large); for a fixed k the V ■,(*) are bounded in K for 3 x£~c, 
while X-, can be made arbitrarily great by taking k sufficiently great. The 9 ?&>(x) 
(A = i, ... j — 1) are non increasing, in virtue of (1), as x-*oo along the nega¬ 
tive axis of reals. In fact, at least for \x\ sufficiently large, the inequalities (1) 
would imply that, for x in K in a strip 

(14) o £%x^(l (o < el), 

wa have 

(M a) <?, (*) <J N Q t (x) £ --3 3 J Q n (x); 
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moreover, (14) will hold, in K , in a wider strip 
(14b) -d<k*xZd 

if (1) is maintained throughout K. 

If 9 \ Qj-i.j(x) approaches — « along the negative axis, the same will be 
true of the (x) (A = I, . J- 2). and, more generally, this would be the 

situation in a strip (14) or (14 b), as the case may be. Thus, if SR Qj.j-i (a;) is 
not bounded along the negative axis of reals, all the exponential factors in (13 b) 
will diminish rapidly enough, as x-*oo in K in the strip (14). to insure the 
asymptotic form 

( 15 ) e QjW stj{x) 

in this strip. If 9 t Qj.j—i (a:) is not bounded along the negative axis of reals and 
(1) is maintained in K, (15) will hold in the strip (14 b). 

Suppose that the functions 

» Qj .*->(*). » ... a Qj.Ax) 

are bounded along the negative axis of reals. In view of the statement in ita- 
lies, following (12), it is clear that (12) will hold. Thus, if <7-* 1, 

r t j(x) = o. 

On the other hand, if a > 1 

(16) r, i (x)=2'*- 

2-1 

The exponential factors in the latter sum may be supposed to approach zero in 
the strip (14 a) (or in the strip (14 b)). Consequently the asymptotic relation (15) 
will certainly hold in the strip (14), in A\ It will be maintained in a strip (14 b) 
if (1) holds throughout K. 

Consider all those curves which lie in T and satisfy equations 

( 17 ) M\Q,j(z) + 2 txV— i a tj x\ = o (* < j) 

formed for all such * and j that 9 i Qn{x) approaches —00 along the negative 
axis of reals. Let B\ denote the lowest of these curves. In the case of no 
curves (17) in l we let B " denote the right boundary of V. If the limiting 
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direction of B“ is that of the negative axis of reals, this curve will be neces¬ 
sarily of the form 

(17 a) v = h(-uY' + --- (x = u + h >0; 1 >e, >0). 

Let B u denote some curve in r, to the left of B* lt with a limiting direction 
slightly different from that of if is not of the form (17 a). In the con¬ 
trary case let B u satisfy an equation 

(17 b) v = h(—uY (e,>e>o). 

In any case B u is to be in r. 

We shall have 

(18) Z(x)~S(x) 

in the closed region K u consisting of the part of K bounded below by the ne¬ 
gative axis of reals and bounded above by the lower one of the two curves B' 
(upper boundary of K), B 

If the ordering (1) is maintained in K the relation 

*«■)- rwfonW) 

may be used to show that (18) holds also in a closed region K 1 consisting of 
the part of K bounded above by the negative axis and bounded below by a 
curve B 1 , lying in 12 , or by the lower boundary of K. The curve B 1 is to be 
considered as determined with reference to the set of equations 

2nV^i Pi)X ) — o (1 < j) 

just as IP has been specified on the basis of the equations (17). 

Consequently the following has been made evident. Given matrix solu¬ 
tions F u (x), Y‘{x) proper in (1) and (1), respectively, there exists a matrix solution 
Z[x), with elements analytic in K (“(1)+ (i|), such that Z{x)= 7 “(x)i*(x) = 
= Y l {x)P‘(x). Here P" (x), i*(x) are half matrices consisting of periodic functions 
proper in an upper and lower half plane, respectively. Moreover, Z(x) may be so 
constructed that Z (x) - S(x) in a sub region of K, extending from the negative axis 
upwards (or downwards) at least to a curve of form v = h(—uY. In the latter 
equation h is positive (or negative) while 1 > e > o. In particular, if the negative 
axis is not If curve, Z (x) ~ S (x) in a sub-region of K (or in K itself ), extending 
from the negative axis upwards and downwards at least to curves of the form 

11 - 32611 . Ada maUunxatica. 60 . Imprim6 le 3 «epl«mbre 1932 . 
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v = h(-uY (h real, * o; i>e>o). 

It is clear that analogous facta will hold with reference to quadrants, above 
and below, the positive axis, to the right of the axis of imaginaries. 


§ ii. The Converse Theorem. 

In this section the problem, inverse to the one solved in the Fundamental 
Theorem, will be considered. The result, in this connection, will be embodied 
in the theorem. 

Theorem III. Let 

(1) eO’Wi.W, e^s t (x) .*«*(■) 

be a linearly independent set of formal series where the Q } (x) and the formal 
s series $j(x) (/ = /,... n) are of the same general description as might occur in con¬ 
nection with a difference equation of order n. Let R lt R t , ...R», R$+u ••• 
the set of consecutive regions, formed with reference to the set of Qj{x){j t =l • • • ») 
on the preceding pages. Let two such consecutive regions have at least a strip 
df say, unit width in common. 

Let there be associated with each region R, (s - l, 2 ,...) a set of n functions 

( 2 ) rt(*). riM. •••£(*) 

analytic in R $ , and such that 

( 2 a ) ~ •> (*) 0 '=l. n\ x in R,). 

Assume that, for s = I, 2, .. 

(2 b) yj(*) = 2 tf +l W Pi} *' W 0 .. 

1-1 

where the p\j* x {x) are of period one. 

It will necessarily follow that there exists a difference equation of order n 

(3) L n [y) = y(x + n) + a,(x)y(x + »- 1) + ••• + a*(x)y (x) = o, 

with coefficients of the same kind in the complete neighborhood of infinity as postu- 


641 



Analytic Theory of Singular Difference Equations. 


83 

lated in § i, possessing the following properties. The series (i) mil constitute a set 
of formal solutions of (3). Each set (2) will he a fundamental set of solutions 
of ( 3 )- 

Proof. Form the determinant 

y(x+ n), tr + »), ... (x + n) 

tj[x + n — 1), [x + n — 1), ... f n [x + n — 1) 

• • • ••• ••• ••• 

y(*). rtto -..jcW 

■ Wy(* + »)+<*.(*)y(* +»—!) + ■•• -f d*(x)y (x). 

On account of (2 a) and of the linear independence of the series (i), 

d\(x)*o % <£{*) v* o. 

The coefficients d](x) {j '» o, 1,... n) are analytic in R,. They will be asymp¬ 
totic, for x in R lt to the formal series obtained by replacing the elements in 
the determinants, expressing these coefficients, by corresponding formal series. 
Thus write 

(4 a) dj (x) - dj (x) 

O' —o, 1,... n; x in R,) 

where 6 j(x) is an s-series. Now the logarithms in the s-series of factors in 
(1) and the Qj{x) enter in such a way that it is possible to combine the co¬ 
lumns in the mentioned determinants so that the logarithms will not enter in 
the dj(x) (j = o, I, ... n). On the other hand, if we write 

l!l <h-!S ... 

we shall have flj(x) analytic in R, (for |x| sufficiently great) and 

(5 b) Oj (x) ** aj (x) 

(j= 1, ... n; x in R,- t a\{x) * o; I, ... si) 

where the formal series o/(x) are in negative powers of x* (p, positive integer) 
with, possibly, a few positive powers present. 
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Noting that 


(-!)*<$(*) 


y(x + ») 

<T, (x + " -i + 0 

j rAx + n-j- 0 

y,(x) 


y.(x + n) 


and combining column., having (2 b) (with s replaced by . - 1) in view, we con 
elude that 


( 6 ) 


d‘ ) {x) = p‘- , ' , {x)d' ) -'(x) 

0 = o, 2,3...p*” ,# (x) periodic, * o. 


If R, and R,-x have a strip along the negative axis of reals in common, neces¬ 
sarily is analytic near the real axis. Otherwise, R, and /?,_> will have 

a strip in common extending indefinitely upwards (or downwards) from the axis 
of reals. In this case 


in virtue of periodicity, is analytic in an upper (or lower) half plane. 
By (5) and (6) 

(7) flj(x) = ap*(x) 0=1. 


A function a } (x), defined as the analytic extension of, say, aj(x), will be analytic 
in virtue of (7) in each of the regions 


^1. • • • 


Moreover, the asymptotic relations 

(8) aj(x)~aj(x) 0=1. •••") 

will be maintained in these regions. 

The equation 

(9) L n {y) = y(x + w) + a,(x)y(.T + n - 1) + • • • + a»(x)y(x) = o 


will be actually of order n and with coefficients of the required type. Each of 
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the sets (2) will be a fundamental set of solutions of (9). This follows from the 
form of the determinant expression for the operator L*[y). In view of (2 a) and 
the definition of the a t (x) (*=1, ...n) in terms of the yj(x) the series (i) will 
formally satisfy (9) in R lt R it ... The proof of the Theorem is thus completed. 


§ 12. The Related Rlemann Problem. 

Let T(x) (= (e Qi ‘ x, 6j(x))) denote a matrix whose elements are those of a 
formal matrix S{x) (— (e^ (xl ty(x))) with the power series factors terminated after 
a number of terms. Suppose that S(x) has the general character of a matrix 
of formal solutions which might occur in connection with a difference system of 
order n, of the type indicated in § 1. The set of Q(x)‘s, 

(1) <M*) 

defines a sequence of consecutive regions 

(2) (l*l>?>o), 

as indicated on the preceding pages. For definiteness suppose that 7 f, is the 
lower one of the regions constituting the quadrant 

— e SargxS^j + t. 

The regions (2) will cover (outside the circle |x| = e) the extended complex plane 
over the range 

(2 a) - t £ argx£ 2np + e, 

where p is a suitable integer depending on the 4 M X )- a particular region 

R, is considered it is essential to keep in view the corresponding range of argx. 
As we proceed in the counter clockwise direction, let B' denote the last one of 
the boundaries of R, encountered. In i?, a fixed determination of T(x) will be 
supposed as given. As a consequence, T(x) will be known uniquely in each of 
the regions (2). For x in R m , in the neighborhood of B” 1 (a portion of a line 
3 x = c>o), the Qj(x) (j= 1, ...n) will correspondingly be the same as in 2 ?,. 
On the other hand, the t tj (x) may be different. The latter situation will take 


644 



George D. Birkhoff and W. J. Trjitzinsky. 

Hb 

place if a factor x l > has r tJ not equal to an integral multiple of i/p or if there 

are logarithms present in some of the *;(*)- 

Assume that associated with B ' we have a matrix 

(s) (5=1,2,...), 

consisting of elements of period one. Suppose that i*‘ +, (s) has the general 
character of a matrix of periodic functions which might occur, as indicated on 
the preceding pages, in connection with a different system of order n, formally 
satisfied by S(x). More specifically, the following is assumed. 

The ft ,+l (x) (*,y= I. ... ») are proper, unless B 1 is a portion of a line parallel 
to the axis of reals. In the latter case they are analytic in a strip along the axis 
of reals. Along B‘ [and within a limited distance of B'), in the vicinity ofx = 00, 

(3 a ) = (flj/ +, (x)) = Tt*)/* •♦'(*) ~ /, 

while the derivatives of all orders of the matrix A'^+'ix) are asymptotic, along B 1 , 
to zero. 

The following theorem will be proved. 

Theorem 4 . Let T[x), the corresponding regions (2) and the matrices (3) of 
periodic functions, 

(5=1,2,...), 

he given. Conceiving these penodic functions assume the statement in italics pre¬ 
ceding this theorem. 

There exist then matrices 

(4) r,(*), Y,{x), . . Y.(z), ... 

such that 

(5) Y.(x) = y,.,(x)P* + *(x) (5=!, 2,...). 

Furthermore, for x in B$, the elements of Y,[x) will be analytic (5ate at x = oo) 
and |y,(x)|^o, while 

(5 a) r.l*)~S(x) (x in R,). 

The exponential factors e Qlx) , occurring in the elements of the formal matnx S(x ), 
are coirespondingly the same as in the matrix T(x). 
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Proof. For the purpose at hand the following theorem established by 
Birkhoff 1 will be used. 

'Let Cg, ... C r be r simple closed curves in the extended complex plane. 
Let A t {x), ... A r [x) be matrices of functions defined and indefinitely differentiable 
along C lt ... C r respectively, analytic save at a finite number of points of these 
curves and of determinant not zero. If furthermore at any point of intersection 
of C a , Cp the matrices /i«(x), Ap(x) are such that the formal derivatives of all 
orders of the matrix 

(6) A m [x)AfM - A p [x)Ux) 

vanish, there exists a matrix <P(x) with the following properties: 

(1) each element of <Z>(x) is analytic except along C lt ... C, and at an 
arbitrary point x = a where the elements may become infinite to finite order; 
|0>(x)| nowhere vanishes save possibly at x = o; 

(2) the elements of (P(x) are continuous and indefinitely differentiable along 
each curve C t from either side, analytic from either side save at points of inter¬ 
section of the curves, or at those points where an element of A t {x) fails to be 
analytic, or at x = e; if a lies on a curve Ct, the matrix (x —a)'A ( (x) (or x“'./l,(x) 
if a = 00] is indefinitely differentiable along Ct for a suitable /; 

(3) if a + and — side of each curve Ct are chosen, then, 

(7) lim <Z>(x) = | lira <D(x)) At(x!\ (t — I.... r) 

—t —>T 

where the approach to the arbitrary point Xt of Ct is along the + and — side 
respectively.’ 

The curves C, may be subjected to weaker restrictions and may extend to 
infinity. Thus, for instance C » (*=i,2,...), may be defined as consisting of 
B * up to a point on the circle |x| = e; inside of this circle a portion of C, will 
consist of a curve I 1 , through x = 0, joining the mentioned point on the rim 
of the circle with another suitable point on the rim of the circle; from the latter 
point on, C, will consist of a curve B * extending to infinity (and analytic in 
every finite part of the plane); moreover, the component parts B\I\ B * of C, 
will be supposed so joined that C, is a simple curve with a continuously turning 

1 Cf. Ill. Methods developed with a view of application to the classical Riemann problem, 
con be found in papers by Hilbert (Gott. Nachr., 1905. pp. 307—338) and Plemelj (Monatsch. f. 
Math. u. Pbya., 1908, pp. 205—246). 
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tangent; furthermore, if the limiting direction of B• is a. that of B' is to be 
considered as a, + n; finally, the several curves C, are to have points in common 
only at x = o and at x= 

The matrices AM may be defined as follows. Along B’ 

( 8 ) AM-A^+Mx 

along B‘ 

(8 a) AM- 1 . 

We take q sufficiently great. On the other hand, along /' the elements of AM 
are defined so that the conditions of the above theorem of Birkhoff hold with 
respect to AM- Thus, except at infinity, the elements of AM arc analytic 
along C$ for \r\ > q; for x on C, 

\AM\*o. 


The condition stated with respect to (6) will certainly hold at x = oo. In the 
neighborhood of x-o the elements of the matrices . 4 ,(x) may be so determined 
that this condition will hold at x — o as well. 1 The point x * u will be sup¬ 
posed to lie interior the circle |x| = p. 

There will exist a matrix <J>(x), with elements analytic for |x| > q except 
along 7J\7J S ,... and except at x = oo, such that |(/>(x)|**o (|x|>p). Along 
each curve B‘ the elements of <Z>(x) will be analytic from either side (for |x| > Q, 
and excepting x = cc). Moreover, for x, on B\ 


( 9 ) 


lim 0>(x) = (lira 0>(x)) 


Here A' ,,+1 (x) is given by (3 a). The + side of B' will be taken corresponding 
to an approach from the interior of 7 ?,. The asymptotic form, at x = oe, of 
<D[x) will be the same along both sides of B' since A'*" rl (x,) ~ 7 . Thus, there 
exist matrices 


(9 a) 


CMx). tf,(x).... 


1 Determination of /t«'x) as stated, involves an approximation problem a solution of which 
liad been given by A. Besikovritsch (Matheiuatischc Zeitsrhrift, Band 21, llcft 1/2, 1924. Another 
solution had been given by Trjitzinsky. tCf. forthcoming paper ‘Approximation by analytic func¬ 
tions with prescribed derivatives' to apper in the Am. Jour, of Math.). 
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such that the elements of U,(x) (s=i,2,...) are analytic (save at z = oe) in It, 
and such that | U,(x) \ ^ o (|a:|>p) in It,. Moreover, 

(9 b) U.[x)= U,+x[x)A”+'{x) {s= 

It is evident, in the light of the papers referred to above and treating Riemann 
problems, that the elements of U,(x) (f=l,2,...) behave at infinity essentially 
as rational functions. For x in It, 

(9 c) U,(x) - U(x) = [u,j(x)) [s = i, 2,...). 

Whether the elements of the formal matrix U(x) can be made to be formal 
series is left, for the present, undecided. 

Write 

(«o) Y,{x) = U.(x)T(x). 

Then (5) will hold. Moreover, for x in R,, 

U.[x)T[x) ~ (■y(*))(<!^%(*)) 

= Sto 2 «.i to 

i 

It follows immediately that the theorem is true. 

In a future paper one of the present authors (Trjitzinsky) proposes to 
develop the analytic theory of linear difference equations with rational coef¬ 
ficients which is a case of particular interest in which more special results can 
be obtained. Here it would be desirable to find those 'principal solutions' which 
stand out because of their peculiar analytic simplicity and to formulate the 
corresponding Riemann problem. The same author proposes also to develop in 
an analogous manner the analytic theory of ^difference equations and of the 
ordinary linear differential equations. 


12 — 32611. Aria malhetnalira. 60. Iraprlm* I* 19 novfmbre 1932. 
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NOTE ON A CANONICAL FORM FOR THE LINEA R 
q-DIFFERENCE SYSTEM 

By George D. Birkhoff and Paul E. Guenther 
Department op Mathematics, Harvard University 
Communicated March 10 , 1941 

The modern theory of ordinary linear ^-difference equations may be 
considered definitely to have evolved with the appearance of papers by 
R. D. Carmichael 1 and G. D. Birkhoff 2 announcing what was then regarded 
as a general theory. Proof of the existence of a full complement of formal 
solutions in the truly general case and an investigation of their convergence 
was later given by C. R. Adams.* More recently W. J. Trjitzinsky 4 has 
discovered a full complement of analytic solutions and has developed 
certain of their asymptotic properties. 

As a point of departure either the single linear ^-difference equation of 
the nth order or the equivalent ^-difference system may be employed. It 
is the purpose of this note to announce the existence and indicate the 
derivation of a canonical form for the linear ^-difference system of the 
nth order analogous to that found by Birkhoff for the linear differential 
system. 5 Consider the matrix form of the ^-difference system 

Y(qx) = A(x)Y(x), \q\ 4 = 1 . ( 1 ) 

where A (x) is analytic or has a pole at x * «, and \A (x) j = = 0 . From the 
work of Adams cited above, such a system (1) has always a full omple- 
ment of n formal solutions, which in the typical case that certain rational 
numbers mi. ms. .... m.« are all distinct may be written (after a simple pre¬ 
liminary transformation) in the form 

PM = (*<K*))(«^ ( " _,) *"*«). 

where the s i; (.y) are power series of the form £ s ijk x ~ k > ($i/( »)) = <«■>) 

*«o 

In this case the matrix A{x) in ( 1 ) has the special form 
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Watjix)), 

where the a i} (x) — 5 ,, vanish to arbitrarily high order at x = <*>. Without 
loss of generality it may be assumed that the m are all integers and such 
that w > ... > n H ^ 0, and that \q\ > 1. 

To the system ( 1 ) is related an adjoint system 

Z(qx) = Z{x)A-'{x) ( 2 ) 

which has formal solutions 

Z(x) = («<,(*)) = 

where the /,,(*) are series in descending powers of x, and the kth of the n 

basic sets of solutions consists of the elements z k .. z kH . But from the 

above-mentioned work of Adams it follows that the formal solutions corre¬ 
sponding to the least value of m, i.e., — aij, are actual solutions as well. 
Thus the elements *u(x), .... z Jm ( x ) constitute an analytic solution of 
( 2 ) in the neighborhood of x = ®. Then for • Z(x ) = (s,^) (a matrix 
having all rows alike) it follows that 

•Z(qx)Y{qx) - -Z(x)A-'(x)A(x)Y(x) = -Z(x)Y(x) 

so that 

jE*u(a*)»(8*) = E %>(*)»(*) = #>(*). 

where p(x) is a ^-periodic function. The change of variable 

yi = jthi (*)%(*) = ? 2 *y y, (3) 

yj = y>. j- 2.« 

then transforms the system (1) into a system (dropping the bars on the y's 
and a’s ) 

y>(?*) = x’YyM 

«(«*) = o«(*)yi(*) + - • - + ojO’.W (4) 


y-(q*) = a,i(*)yi(*) + ... + a.„y,(x). 

This new system has n independent sets of formal solutions with m,’s and 
s distributed as in the original system. Furthermore the matrix (a^x)), 
i, j = 2, ..., n is rational at x = ® and of determinant not identically 
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zero. The corresponding matrix of independent formal solutions is 
obtained from that of ( 4 ) by deleting the first row and column. 

In the case n = 2 the system ( 4 ) becomes (with p's and r’s in inverted 
order) 

yiiqx) = x'Y’yito < 5 ) 

yiiqx) = a 7 i(x)yi(x) + a 72 {x)y J {x). 

Let y 2 (x) — q iK W's(x) in the solution for which y, = 0 . Then the 
change of variable yi = % y* = s(x)yt converts ( 5 ) into 

yi(qx) - xT 4 'y x (x) (6) 

y*(qx) = (hi(x)yi(x) + xf'j'yti*)- 

Now by means of a transformation y x = y,, y, = a(x)yi + y*. where a(x) = 

^dx~ i is to be determined, it is found that the corresponding coefficient 

in the transformed system is 

flai(x) = ct{qx)y?'<f' - a(x)x'Y' + a, ,(.v). (7) 

This implies the following relationship between coefficients of ith powers 
of x: 

s hi - - c,+ m <r + (« 

For convenience let u = a it (hu = a,*, pi — pi " p, and replace i + P 7 
by i. Then (8) becomes 

5.- w = ^ J” + o,-.,. (9) 


For simplicity in this report consider in detail only the case p = 1 . Take 
to arbitrarily large and set c* = 0 . Then a^ m can be made zero by choos¬ 
ing as dictated by ( 9 ). This process can be repeated until 

Co is determined. There remains only the relation = Coq n to be satis¬ 
fied, which is taken as the definition of a_ Mt . The coefficients c,,_ 2 . • • •. 
Co thus determined are given by the formula 

, + O-lMr,-.) -jr, 

Ci = q 'a i+ 1 -„ + IU0) 

i“2 


for t = 0 , .... Jo — 2 . Now let i 0 —► ®. The series in ( 10 ) plainly is 
convergent for every i ^ 0 . Thus Co, C\, Cj, ... are so determined that by 
the transformation y x = y u y* = a(x)yi + y?, n 7 l (x) may be reduced, in 
the case p — 1, to the simple form 
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an(x) = C<4'x?\ 

For the more detailed but equally straightforward ease where p > 1 the 
resulting form is 

5«(*) = x'V ’t'ctqx)- 1 . 

i-0 

The system (1) of the second order may consequently be transformed into 
the canonical system 

yi(qx) = x'YW*) 

yriqx) = Ec.W'Wr) + (11) 

i-0 

A canonical form for the system of order three is obtained by first apply¬ 
ing the corresponding transformation ( 3 ). The solutions of the resulting 
system of form ( 4 ) for which y x = 0 satisfy a second order system in y J( 
yj which may be treated as outlined above for the general second order 
system. Then by means of a first transformation of the form y x = y Xl 
5*j = 0 (x)y x + y t , y, = y, where 0 (x) is determined by the method resulting 

p-\ 

from ( 9 ) the coefficient a 7 X (x) may be reduced to x'V'C £ where 

<■1 

P - m - Also by a transformation y, = y x , y t = yt, y« ■ y(x)y x + yj, 
asiW niay be similarly reduced. Continuing in this manner by an 
induction on the order of the system there is obtained in all cases a canonical 
form for the system of order n striking in its simplicity and symmetry. For 
(1) of order n, the equivalent system may be written in the form 

y x (qx) = ***•/">»(*) 

yfa) = Wet X (x)y x (x) + a f'-'tT-'ytix) (12) 


yjqx) = + ••• + ,(x)y.-,(*) + x M '<f'y n (x) 

ail 

where the Cy(x) are polynomials in \/x of the form c,.(x) = £ c iik (qx)~ k 

*-o 

and ay = Hn + i-j ~~ Fn+i-i — 1. A detailed treatment of all cases 
will be found in Guenther’s doctoral dissertation, and this together with 
further developments in a joint paper which the authors expect to offer 
to Acta Mathematica. 

Up to the present time the theory of linear g-difference equations has 
lagged noticeably behind the sister theories of linear difference and differen¬ 
tial equations. In the opinion of the authors the use of the canonical 
system, as formulated above in a special case, is destined to carry the 
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theory of g-difference equations to a comparable degree of completeness. 
This program includes in particular the complete theory of the convergence 
and divergence of formal series, the explicit determination of the essential 
transcendental invariants (constants in the canonical form), the inverse 
Riemann theory both for the neighborhood of * =» ® and in the complete 
plane (case of rational coefficients), explicit integral representation of the 
solutions, and finally the definition of 5-sigma periodic matrices, so far 
defined essentially only in the case n — 1. Because of its extensiveness 
this material cannot be presented here. 

1 Carmichael. R. D., "The General Theory of Linear ^-Difference Equations.” Am. 
Jour. Math., 34, 147-168 (1912). 

* Birkhoff. G. D., "The Generalized Riemann Problem...,” Proc. Am. Acad., 49, 
521-568 (1913). 

1 Adams, C. R.. "On the Linear Ordinary g-Difference Equation.” Ann. Math., Ser. 
II. 30, No. 2. 195-205 (1929). 

« Trjitzinsky, W. J., "Analytic Theory of Linear g-Difference Equations.” Ada 
Mathematica, 61, 1-38 (1933). 

• Birkhoff. G. D.. "Singular Points of Ordinary Linear Differential Equations,” 
Trans. Am. Math. Soc.. 10, 436-170 (1909). 
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QDELQOES TH£0REMES 

SUR LE MODVEMENT DES SYSTEMES DYNAMIQUES, 
Par M.-G. D. Birkiioff. 


Parmi les mouvements d un systeine dynamiquc, il pent en 
exister certains ajraot la propriele remarquablc de representer le 
mouvement tout entier avec tel dcgre d'approximation qu on 
desire, pendant tout intervalle de temps egal a T, ou T vane sett¬ 
lement avec le degre de l’approxiination. Dans le present 
Memoire, ces mouvements sonl appeles mouvements recurrents; 
ils forment unc extension naturelle des mouvements periodiques. 

Du point de \ue que j’adopte, un mouvement stable est un 
mouvement qui, a parlir d un certain moment et ulllrieurement, 
n'approclie jamais indrfiniment pres dc certaines positions singu- 
lieres; je demontre qujil exisle necessairement un ou plusicurs 
mouvements recurrents dans le voisinage infinitesimal d un mou¬ 
vement stable. 

Les mouvements recurrent* se rangenten deux types, Pun d'eux 
(Slant representable explicitcment au moyende fonctions continues 
et periodiques des variables qu’elles conticnnent. Ce premier type 
est appele type continu et semble comprendre tous les mouve¬ 
ments recurrent* dont (’existence a etc etablie. Je monlrc, par la 
construction d’un cxemple, que, au inoins lorsqu’on n’assujellit 
pas les equations diflerenlielles definissant le mouvement a £trc 
nnalvtiques, il exisle des mouvements recurrent qui ne sont ni 
periodiques, ni dans le voisinage infinitesimal d'aucun mouvement 
periodique, et qui sont d une nature particuliere telle que la 
designation de mouvements recurrents disco/itinus leur semble 
appropriate. 

Des reclierclies incompletes me conduisent a croire que, dans le 
cas dc /t = 3, il existe aussi necessairement des mouvements 
recurrent* discontinus dans les cas analytiques, et que la struc¬ 
ture de ces mouvements est essenliellement semblable a celle du 
mouvement presente dans fexemple. 

B. i 
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Ce Me moire a ete lu pour la premiere fois a la reunion d’ete de 
I’American Mathematical Society en 1909. 

I. — Representation gkometmque. 

Un systeme dynamique, dans une tres large acception, pent 
£tre consider*; comme defini par un systeme quelconque dYqua- 
tions dilIV*rentielles ordinaires du premier ordre : 

dx x dx , _ _ dx n __ 

(i) x7“ xT-XT"* 1 ' 

ou X„ . • ., X„ sont des fonclions donnees, reclles et uniformes 
de x,, . . ., x„, analytiques par rapport a ces variables, et ou t est 
la variable independante. Les variables x„ sont les coor- 

donnecs du mouveinent, et / indique le temps. 

Nous convenons de represcnlcr les coordonnees du mouveinent 
comme les coordonnees rectangulaires d’un point de I’espace a n 
dimensions. Tout mouveinent possible est defini par n Equations 

ou les fonclions analytiques constituent une solution 

de (1). Les Equations (9.) sont aussi represcnt^es par une courbe 
de l’espace a n dimensions, a moins que ..., ; n ne soienl 
toutes constantes; alors (2) sont representees par un seul point. 

II fatal maintenant rappeler quelques-uns des resultats fonda- 
mentaux et bien connus, relatifs a un systeme difierentiel du 

type( | ). .... 

Soil S fensemble des points pour lesqucls au morns une des 
fonclions X,, . . ., X /# cesse dVtre analytiquc, et soil E l'ensemble 
des points, ne faisant pas parlie de S, pour lesquels les equations 

(3) X| = o, .... X/, = o 

sont satisfaites siiuultanement. 

Par tout point P, cxterieur a S et E, il passe une courbe (P) et 
une seule, donnee par une infinite simple de mouveinents (2), 
savoir par.ceux qu'on obtient en remplacant t par t c, c elanl 
une constante. A deux points quelconques A et B sur une seule et 
meme courbe, correspond un inlervalle de temps, qui est le temps 
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n^cessaire aux coordonnees du mouvement pour passer de celles 
de A a celles de B. 

Les points E representent les mouvemenls a coordonnees con- 
slantes satisfaisant a (a), Lorsque t croit ou dlcroit, le point mobile 
sur une courbe (I*) ne pent tcndre vers un point-liinite E dans 
un temps (ini, car cela contredirait le theoreme fondamental 
d’existence pour un systeme tel que (i). 

Le point mobile sur une courbe (P) pent cependant tendre vers 
un point-limite S dans un temps (ini. Mais on fera la convention 
que la courbe nc s'etend pas jusqu a cc point. 

Nous entendrons par le voisinage (c) de S les points qui sont, 
soil a une distance inferieure a s d un point de S, soil a une dis¬ 
tance supericure d | de 1’origine des coordonnees. 

La courbe pa6santen un point P, exterieur a Sou E, est suscep¬ 
tible d’extension pour lim / = -+- oc ou lim t = — co, a moins que 
lorsque / croit ou decroil, respectivement, depuis la valeur ini- 
tiale t 0 , vers une limite (inie 7, le point mobile sur la courbe (V) 
ne tende a venir et a rester dans tout voisinage (s) de S, si petit que 
soil £. Ce fail est Evident, puisqu'eit dehors d’un voisinage (e) 
donne de S, l’extension est possible d une maniere uniforme pour 
un seul et mc l me intervalle de temps. 

Enlin, si t est pris comme parametre de la courbe y on peut 
dire que la variation continue d’un point P (exterieur a S) pro- 
duit une variation continue de la courbe (P). Avec plus de preci¬ 
sion, nous avons l’^nonce suivant : 

ThCoreme auxiliairk. — Si P n est pas un point de S, et si le 
mouvement , t/ui pour t = t 0 est represents par P, est suscep¬ 
tible d’extension pour il rst possible de c/ioisir unc 

quantile 0 positive assez petite y pour que tout mouvement qui y 
pout t = t Q , est represents par un point P' de distanced P infe¬ 
rieure d 0 , soil aussi susceptible d'extension pour -:<t<Q y et 
soil en outre tel que , durant cet intervalle , la distance entre les 
points correspondants de la courbe (P) et de la courbe (P f ) 
n'excede pas une quantile positive arbitrairc t. 

Ceci est ('application aux equations (1) d’un theoreme bien 
connu coneernant la variation d’une solution d’equations dilleren- 
tielles ordinaires, lorsqu’on fait varier les conditions initiales. 
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J[. — Stability f.t instability. 

Nous pouvons maintenant deduire certains theoremes generaux 
simples, concernant 1e mouvement d un systerae dynamiquc tel 
que(i). 

A celte fin, nous classerons les mouvements possibles d’apres 
la definition suivante : un mouvement est dit positivement 
(ou ntgativement) stable , s’il existe un nombre e tel que 
pour t > t 0 (on t< t 0 ), les points representatifs du mouvement 
restent ext^rieurs a quelque voisinage (e) de S. Sinon, le mouve- 
menl est appeie positivement (ou negativement) instable. Cette 
definition impose au mouvement le minimum de restrictions 
necessaires a assurer la slabilite, dans son sens mathemalique 
strict. 

L’exlension d’un mouvement stable est possible pour toutc 
valeur de t > t 0 (on t < t 0 ), puisque l'cxtension d’un mouvement 
ecsse seulement d’etre possible lorsque, t approchant d’unc ccr- 
taine valeur finieT, le point representatif vientet reste a l’interieur 
de tout voisinage (c) assign^ de S. 

Les types les plus simples de mouvement stable sont fournis 
par le cas du repos, represente par un scul point E, et par celui 
du mouvement periodique, represente par une courbe ferim'c. 

Tous les points desquels les points de la courbe representative 
d’un mouvement s’approchent indefiniinent pour lim/=4-a> 
(ou lim/= — co), seront appeles les points-limites omega (ou 
alpha) du mouvement; ils forment, bien entendu, un ensemble 
ferine. Les points de la courbe elle-meme ne sont pas ncccssaire- 
ment dans l’une ou l’autre des classes de points-limites. 

Dans un Memoire important, M. Hadamard (')aappcle l’en- 
semble des points-limites, le domainc du mouvement. En particu- 
lier, il a prouve que l’enscmble des points-limites d’un mouvement 
stable represente un ensemble de mouvements stables, contenant 
au moins un mouvement (*). 


(*) Sur les trajectoires en Dynamique (Journal de Mathematiques, 3* siric, 
t. III. 1897 , p. 33■ *38 7 ). 

(*) Loc. cit., p. 38^. 
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Ce fait constitue la inajeure partie du llicoremc suivant : 

Th£oremf. I. — Les points-liniiics omega d'un mouvement 
posilivement stable represen lent un ensemble de mouvements , 
posilivement et negative men t stables , desquels le mouvement 
donna s'approc/tc uniformement quand le temps croit, e'est- 
a-dirc que la distance entre le point representant le mou¬ 
vement ct Vensemble, des points-Umites tend vers o pour 
lim / = -4- co. 


Demonstration. — Soil L un point-limite omega d’un mouve- 
meni posilivement stable donne. Alors, puisque le mouvement 
donne est stable, L est exterieur a un certain voisinage fixe de S, 
et la courbe (I.) petit Olre etendue dans les deux sens a partirde I,, 
pour un intervallc de temps uniformement grand pour tons les 
points I.,. Mais, cn vertu du theoremc atixiliairc, toute courbe qui 
passe en des points Q suffisaiumcnt voisins de L, reste a une 
distance inferieure a s de la courbe (L) pendant le m£ine temps. 
Puisquc la courbe donnre representative du mouvement passe cn 
de tels points Q a pres tout instant fixe il est clair que Parc de U 
courbe (L), correspondant a cet intervallc de temps uniforme, se 
compose de points-limites omega. 

En repetont ce raisonnement, on voit que tons les points de la. 
courbe ( L) sont des points-limites omega. 

De plus, puisqu’aucun dcs points-limites ne peut £tre a Tinte- 
rieur du voisinage fixe de S, ectte courbe (L) pent etre etendue 
pour toute valeur tin temps, ct est posilivement et negativement 
stable. 

Pour completer la demonstration du tlicoreme I, il n’v a plus- 
qu*a montrer que, lorsque t croit, le point mobile sur la courbe 
representant le mouvement donne, tend uniformement vers les 
points-limites (L). Supposons qu’il n'en soil pas ainsi; alors if 
existe une quantile positive d et une suite d'instants t. 2 , . 

(lim t„ = -f- oo), pour lcsquels la distance entre le point repre- 
sentatif du mouvement donne et lout point-limite omega est au 
inoins egale a d. La suite correspondante de points aura au inoins 
un point limite L', qui sera distant d’au inoins d de tout point L. 
Malgrc eela, \J doit etre lui-meme un point L, puisqu’il possede 
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l a propriety fondamentale de ces points. Ceci est impossible, par 
consequent, le theoreme est neccssaireinenl vrai. 

A cause de la svmetrie impliquee, nous ne considerons pas, dans 
le tlieoreme qui vient d’etre demontre, les mouvements negative- 
ment stables, pas plus que nous ne le ferons dans les theoremes 
suivants. 

II sera commode d’appeler les mouvements represents par une 
courbe composeede points-limites om£ga(ou alpha), mouvements 
Unities omega (ou alpha). 

Tiikohemk II. — Slant donne un motive merit positivement 
instable : 

i° On bien, il est settlement susceptible d'extension pour 
t ses points representatifs tendant a venir et rester dans 
tout vo is in age ( t) de S, pour lim / = t; 

2° Ou bien , il est susceptible d'une extension indeftnie 
quand t croft, et il est tel que , pour tout nombre positif z, on 
peut trouver un intcrvalle de temps T, assez grand pour que, 
au moins une J'ois dans tout intcrvalle T pour t^.t 0 , le mouve- 
nicnt soil represents par un point interieur au voisinage (e) 
tie S; 

3 " Ou bien, il est susceptible d'une extension indeftnie 
quand t ct oil, et possede au moins un mouvement limitc omega, 
positivement et negativement stable. 

Demonstration. — La demonstration se fait aislment. 

En premier lieu, si l’cxtension n’est possible que pour * < 
nous avons evidemment le cas (i°). S’il n'en est pas ainsi, suppo- 
sons que la condition du cas (2 0 ) ne soil pas satisfaite, et cher- 
chons a demontrer que nous avons alors le cas ( 3 °). 

Dans ccs conditions, il existera une quantite positive s et une 
suite d’intervalles de temps indeOniment croissants (*,,*,), 
(* 2 , l’ 2 ), . . ., durant lesquels tous les points representant le mouve- 
ment instable donne sont en dehors du voisinage (s) de S. 

Soient alors I*,, I*,, les points correspondant aux milieux 
des intervalles de temps de la suite. Cette suite de points a au 
moins un point-limite P, qui est au moins a la distance z de tout 
point de S. Par un raisonnement semblable a celui qui a iU fait 
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dans la demonstration de la premiere partie du theoreme 1, on 
voit que la courbe (P) peut etre etendue pour toute valeur de *, 
et reste en dehors du voisinage (s) de S. Brievement indiquee, la 
raison en cst que les courbes (Pa) (A* = i, 2, . . .) sont situles en 
dehors du voisinage (e) de S pour des intervalles de temps positifs 
et negatifs indefiniment croissants avec A', landis que certains des 
points Pa tendcnl vers P. Par suite, en vertu du theoreme auxi- 
liaire, la courbe (P» a les propriety indiquees. Ainsi le mouve- 
ment M satisfait a la condition ( 3 °), puisque le inouvement(P) est 
Un inouvement limite de M, positivement et negativement stable. 


III. — Molvement RECURRENT. 

On vient de voir que tout inouvement M, positivement 011 
negativement stable, possede un ensemble fermi de mouveinents 
1 iinites M', positivement et negativement stables. 11 s ensuit imine- 
diatement que lYnsemble des mouveinents limites alpha et omega 
de tout mouvemenl de M', ou bien coincide avec M\ 011 bien est 
un sous-ensemble de M' (•). 

Definition. — Tout ensemble ferine M' de inouxements posi- 
tivement et negativement stables, tel que tout mouvemenl de M 
ad met M' pour son ensemble de mouveinents limites alpha, ainsi 
que pour son ensemble de mouveinents limites omega, sera appcle 
un ensemble minimal; et tout inouvement de M' sera appcle un 
inouvement recurrent . 

Par definition, un inouvement recurrent M est stable. De plus, 
tout point de la courbe representative est a la fois un point-limite 
alpha et un point-limite omega, car autrement l'ensemble des 
mouvements limites de M ne contiendrait pas M, contrairement a 
la definition. Ce fait signifie que le mouvemenl approchc arbitrai- 
rement pres d’une ijuelconque de scs positions infiniment sou vent 
pour 1 i 111 t = — x et lim t =-h sc; e’est-a-dire que le inouvement 
est stable au sens de Poisson. 

Toutefois. eetlc propriete ne earacterisc pas seulement les niou- 


(■) Cf. IIadamaud. toe. cit.. |>. 
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vements ^currents. Par exemple, on peut construire, pour n = 3 , 
un systeme dynamique tel qu’il existe un mouvement positivement 
et n^gativement stable M, dont les points-limites alpha et omega 
constituent tout l’intlrieur et la surface d’un tore. Un tel mouve- 
menl est stable au sens de Poisson, mais il ne peut Sire recurrent 
au sens defini ci-dessus; en eflet, tout mouvement dont la courbe 
est situee sur la surface du tore appartient a M', mais ses mouve- 
ments limites forment seulement unc partie de M'. 

Les exemples les plus simples de mouvement recurrent sonl 
les cas du repos et du mouvement plriodique; dans ces deux cas, M' 
coincide avec M et se compose d’un seul mouvement. 

Dans tous les autres cas, M' contient une infinite de mouvcments 
dont le nombre cardinal est celui du continu. En eflet, en un 
point P de la courbe representative d’un mouvement recurrent 
qui n’est pas de ces types simples, construisons un petit element 
plan normal a la direction de la tangente en ce point, et conside- 
rons l’ensemble ferme de points forme par 1*infinite des intersec¬ 
tions de toutes les courbes de M' avec cet element. Tout point de 
cet ensemble, comme par exemple P, est un point-limite des 
points de l’ensemble, puisque c cst un point-limite alpha et omega 
de l’ensemble infini correspondant au mouvement (P). Par suite, 
Pcnsemblc de points est parfaite t a la puissance du continu. Mais 
chaque mouvement (P) correspond seulement a un ensemble 
denombrable de tels points. Puisque le nombre cardinal des mou- 
vements de M # ne peut exceder celui du continu, il lui est neces- 
sairement egal. 

La propriete caracteristique des mouvements recurrents est 
donnee par le th6oreme suivant : 

Th£oiieme 111 . — La condition necessaire et suffisante pour 
qu'un mouvement positivement et negativement stable M soit 
un mouvement recurrent est que pour lout nombre positif e, si 
petit quit soit , il exisle un inlervalle de temps T, assez grand 
pour que Varc de la courbe representative correspondant a 
tout intervalle egaldT ait des points distants de moins de z de 
n'importe quel point de la courbe tout entiere. 

Demonstration. — La condition est necessaire. En effet, si 
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elle n’est pas satisfaitc par quelque mouvcment recurrent M, il 
sera possible de clioisir une succession de points I*,, P a , . . ., sur 
la courbe representant le mouvement, el une succession d’inter- 
valles (/,, t\ ), (l«, 4), . . . , correspondant a des periodes de temps 
indefiniment croissantes lelles que, durant I’intervalle (/*, f' A )» les 
points de la courbe ne vienncnt pas a une distance moindrc que z 
(s etant une quantile positive) du point P* de la courbe. 

Soit Qa le point de l arc correspondant au milieu dc l’intcrvalle 
(tky 4 ). L’ensemble Qa a un point-limite Q, obtenu en prenant la 
limite d’un ensemble partiel Q* t , Qx s , ..., convcnablement cboisi 
dans l'ensemble donne, et I’ensemble partiel P X| , P i( , ... a au 
moins un point-limite P. Par suite, cn fixant seulement noire atten¬ 
tion sur une suite de la serie A ,, A* a , .... pour laquelle l\ tend vers P, 
nous pouvons ecrire simullanemcnt: limQA = Q et lim 1 **= P. Lcs 
points Q et P seront naturcllement des points-limitcs L de M, el 
tout point de la courbe (Q) ne sera pas distant du point Pdc moins 
de s. Ceci ivsulte du fait que la courbe <(>a) nc vienl pas a une 
distance de Pa inferieure a z pendant des intervalles de temps 
positifs et negatifs indefiniment croissants nvee A, et que, par 
consequent, la courbe limite complete ne vient jamais a une 
distance dc I* inferieure a e. en vertu du tlieoreine auxiliaire. 

L'ensemble des mouvements limites du mouvcment (Q) dc M' 
est done seulement un sous-cnscmble dc M', puisipi'il nc peut 
contenir Ic mouvcment (P). Mais ceci contredit le fait que M est 
un mouvcment recurrent. Par suite, la condition est necessaire. 

On \o‘il aussi aisement que la condition imposee a M est suffi- 
santc. 

Car supposons celte condition remplie. D’abord, il est clair que 
l’ensemble des mouvements limites alpha, ou celui des mouvements 
limites omega, consistera, d’apres le theoreme I, en un ensemble 
ferme de mouvements positivement et negativement stables M'; 
de plus, en vertu dc la condition qui est supposee satisfaitc, eba- 
cun de ces deux ensembles doit contenir M, et, par consequent, les 
deux ensembles coincident. 

Si M n’est pas un mouvcment recurrent, il exislera dans l’en- 
semble des mouvements limites M', un mouvement M t , avec un 
ensemble de mouvements limites omega (ou alpha), qui est un sous- 
ensemble de M'. Soit P un point-limite omega dc M et non de M ( , 
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de telle fi.con <|ue 1* se irouve a une distance exccdant quelqtm 
constante positive 7, de tout point de M,. A certa.ns instants, 

telsque lim /„ = + *. les points correspondants du mon- 
vement M viendront a une distance inferieure a une quantile petite 
quelconque 3 d’un point Q de M„ et, par suite, cont.nueront a el. e 
a une distance inferieure ii lout nombrc e des points de M,, durant 
tout intervalle de temps donne. si grand qu’il soil, cn vertu du 
theoreme auxiliaire. 


En clioisissanl s < nous voyons, par consequent, qu'on pent 
trouver des intervalles de temps indefmimenl croissants, durant 
lesquels les points de la courbe (M) ne viennent pas a une distance 
inferieure a 2 de 1 ’un dc ses points-limites omega, savoir de P. Par 

consequent, la condition n’est pas salisfaile, a moms que le mou- 
vement ne soil recurrent. C'est-a-dire que la condition est suffi- 


saute. 


IV. — MoUVF.UENTS STABI.KS F.T MOUVKMKXTS lUiCt'RRENTS. 

J 1 y a une relation etroite entre les mouvements stables el les 
mouvements ^currents d un systeine dynamique. 

T»i£oreme IV. — L f ensemble des mouvements limitcs omega M', 
dc tout mouvement positivement stable M, contlent au moms 
un mouvement recurrent. 

Demonstration. — Designons par S a les regions fermecs en 
lesquelles l’espace a n dimensions est divise par des systemes de 
plans equidistants, paralleles aux plans de coordonnees, compre- 
nant ces plans, et situes a la distance d les tins des autres. Consi- 
derons les regions S, ayanl chacune pour volume Tunite. La 
courbe representant le mouvement M a des points dans un nombre 
fini de ces regions, puisque M reste dans une portion finie de l*es- 
pace. 

Soil M, un mouvement de M' dont la courbe correspondante 
entre dans le moindre nombre possible des regions S,. Conside- 
rons a part ces regions S,, et divisons chacune d'elles en 2" re¬ 
gions S,, de volume Tous les mouvements liiniles omega de M, 
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ont leurs courbes qui restent entierement dans ces regions par- 
tielles; parmi ces mouvements, il en existe un, M a , dans M' t (et, par 
suite, dans M'), dont la courbeentre dans le moindre nombre pos¬ 
sible des regions S,. En continuant de celte facon, nous pouvons 

choisir des mouvements M,, M a , M a , . . dans M', en considerant 
les regions successives S,, S , S,, .... 

i \ 

Soit P,, P a , . . . n’iinporte quel syst^me de points situtfs respec- 
tivement sur les courbes repr^sentant M,, M a , . . ., et soit P un 
point-limite de cette suite de points. Alors P est naturellement 
un point-limite de M, puisque l’ensemble de ces poinls-limites 
est ferm<$; et la courbe passant en P a la proprtete d'etre situee 
dans cliacun des systemes de moindre nombre de regions S,, 
S,, . . ., puisque tous les points de la courbe (P*) sont dans les 
systemes de moindre nombre de regions S,, S,, . . ., S, . 

R6ciproquement, il est clair qu’un point quelconque apparte- 
nant a tons les systemes de regions choisies est un point-limite du 
mouvement (P). Autrement, la courbe (P) reste entierement dans 
cbaque systeme de regions S*, S,, . . ., mais ne vient pas arbitrai- 

rement pres dc quelque point Q conlenu a la fois dans toutes. Par 
suite, on voit que, si k est pris suffisamment grand, la courbe (P) 
n’entrera pas dans la region particuliere qui contient mais, 

d’apres la definition des regions choisies S^, la courbe repr^sen- 

tant le mouvement^P; doit entrerdans cliaque region St , puisque 

la courbe reste entierement dans ces regions et que le mouvement 
fait partie de M'. Ainsi, il se produirait une contradiction, si la 
propriete rtfciproque n’etait pas vraie. 

Nous pouvons maintenant d^montrer immediatement que le 
mouvement (P) est recurrent. 

En premier lieu, les ensembles de mouvements limites alpha et 
omega du mouvement (P) doivent coincider. Car, autrement, il 
existe un mouvement limite alpha ou om^ga, dont les points-limites 
sont un sous-ensemble des points-limites du mouvement (P). Par 
consequent, la courbe correspondante reste entierement dans les 
regions choisies S^, mais n’entre pas dans toutes ces regions, bien 
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que le mouvement appartienne a M'; ceci est impossible. Secon- 
dement, par un raisonnement analogue, nous concluons imm^dia- 
tement que les mouvements limites du mouvement ( P) forment un 
ensemble minimal. Par suite, le mouvement (P) est recurrent. 

Theoreme V. — Tout mouvement positivement stable M a la 
propriete que , pour tout nombre positif e, on peut c/ioisir un 
nombre positif T asses grand pour que , sur tout arc de la 
courbc M correspondant a un intervalle egal a T, apt cs 
Vinstant initial t = t 0y tons les points ne rcstent pas d une 
distance au moins egale d c des courbes de tons les mouve¬ 
ments limites ricurrents de M. 

Demonstration. — Si le theoreme n’etait pas vrai, nous pour- 
rions choisir une suite d’intervalles de temps indefiniment crois¬ 
sants (/„/;), ...» durant lesquels aucun point de la 

courbe representant M ne serait a une distance inferieure a une 
cerlaine valeur c des points de tout mouvement limite recurrent 
de M. Soil P un point-limile des points P,, P„ .... correspon¬ 
dant aux milieux des intervalles (*i, l,), (tj, tj), 

Tout a fait com me plus haul, nous en deduisons immediate- 
ment I’existencc d’une courbe passant par P, positivcmcnl et nega- 
tivement stable, telle qu'auctin de ses points ne soil a une distance 
inferieure a s des points representatifs de tout mouvement limite 
recurrent de M, bien que les points de la courbe ( P) soient tons 
des points-liinites omega du mouvement M; ccci est absurde, 
puisqu’au moins un mouvement limite du mouvement (P) est 
recurrent, d’apres le theoreme IV. 

Ce llieoremc conduit a conjecturer quon peut representer un 
mouvement stable donne, aussi exacteinent qu on le desire, au 
moyen d’un ensemble de mouvements recurrents, et de mouve¬ 
ments qui leur soient asymptoliques. 

V. — Classification des mouvements recurrents. 

Quels sont les types de mouvements recurrents? Dans le cas 
de n = i, l’unique mouvement de cette sorte est donne par x, = c ; 
dans le cas de n = 2, on a seulement, outre le cas x t = c,, x 2 = c 2 , 
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le cas du mouvement periodique 

*t= P/(e , 1 c v^**.T) (* = !,a), t = J(^{e^^^)dx 

(ou A‘ t est un nombre r^el positif, et t un parametre), car les 
courbes representatives d'un mouvement stable dans le plan sont, 
ou ferm^es, ou asymptotiques a une courbe ferm^e ('). 

Dans le cas d'un plus grand nombre n de dimensions, on a n 
especes analogues de mouvements, savoir : 


xt=ci (i = i, 2, .... n), 

x t = (* = i, 2, ...,/»), t= J % Q(e^/= r ^^)dx t 

x i= P(* = !,», ...,n), 


ou les quantiles A* a , . . sont reelles, positives et incom- 

mensurables: mais on n’a aucunc raison de croire que cette classi¬ 
fication epuise tous les cas possibles. 

Cc type de mouvement recurrent peul £tre appele mouvement 
recurrent continu et a la propriete.que ses points-limites rem- 
plissent un continu d’un certain nombre de dimensions. Tous les 
mouvements ^currents ne possldant pas cette propriety peuvent 
4 tre appeles discontinus. 

Dans les problemes de Dynamique complement integrables, 
c’est seulement le premier type qui se rencontre. Dans les cas non 
integrables, je n’ai pu decouvrir un exemple dans lequel on ait 
montre l'existence d’un mouvement du deuxieme type. 

II est inlercssant de considerer, sous ce rapport, le cas de n = 3 
dans lequel les trajectoires de la Dynamique sont les geodesiques 
d’une surface a courbure totale negative, sans singularites.. 
M. Hadaniard ( a ) a traite ce cas. II divisc les geodesiques en trois 
categories : 

i° Celles qui sont fermees ou asymptotiques a des geodesiques 
ferm^es; 


(‘) Cf. Poincare, Sur les courbes definies par les equations diJJerentielles 
(3* panic) (Journal de Mathematiques, 4* scrie, i885, p. 228 ). 

(’) Les surfaces a courbures opposees el leurs lignes geodesiques (Journal 
de Mathematiques, 5* serie, t. IV, 1898 , p. 27 - 73 ). 
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2° Celles qui s’eloignent sur les nappes infinies de la surface ; 

Les geodesiques restanles, qui restent dans les regions times 
de la surface et qui sont en nombre inlini. 

Le inouvement correspondant a une geodesique de cette troi- 
sienie catdgorie esl stable, et, par consequent, panni les geode¬ 
siques representatives des inouveinents limiles, il existe au moins 
une geodesique correspondant a un inouvement recurrent. Cette 
geodesique ne peut elre de la seconde calegorie, ni de la premiere 
et asymptolique a une geodesique fermee, puisqu aucune de ces 
possibilities ne peut corresponds aim inouvement recurrent. Par 
suite, ou bien toule geodesique de la troisieme calegorie tend a 
s’approclier infiniment souvent, indefiniment pres en position et 
direction (\'unc certaine geodesique fermee, pour s eloigner ebaque 
fois de nouveau lorsque le temps continue a croitrc, ou bien il 
existe un inouvement recurrent donne par une geodesique dc la 
troisieme calegorie. 

S'il existe un inouvement recurrent de cette troisieme calegorie, 
les recbercbes de M. Hadamard tendent a montrer qu’il doit etre 
du type discontinu. 

l)ans les paragraphed suivants, je donnerai un exemple duquel 
il resulie que, an moins lorsqu'on ecarte ('hypothesc que 
X,, ..., sont analytiques (' ), il existe des mouvements 
recurrents discontinus y pour un systemc dVqtiations 1i), dans le 
cas de n = 3 . Ceci ne permet pas de donner une conclusion polit¬ 
ies problemes de Dynnmique acluellement etudies. 

VI. — Sun I.ES K.NSK'I RI.KS PARKAITS Ql l XE SONT DENSES NUI.I.K PART. 

Comine preliminaire a la construction d un inouvement recur¬ 
rent discontinu, il nous esl necessaire d’etablir un certain type de 
transformation d’un ensemble parfait, nulle part dense, en lui- 
menic. 

Prenons un tel ensemble de points P sur un cercle de rayon i, 
sur lequel ebaque point esl determine par une variable angulaire 0. 
/Vous alio ns montrer qu il existe une transformation continue 


(') Les definitions employees anicrieuicment doiveut 01 re convenablemcnl 
inodifices. 
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de cet ensemble en lui-meme , qui conserve I'ordre sur le 
cerclc , el qui est analogue a tine rotation d’ un angle incom¬ 
mensurable avec 

Inscrivons. en ordre dc grandeur non croissante, les inter- 
valles I*, I* 2 , 1*3 Pi, ... du cercle, a Pinterieur desquels il ny a pas 
de point de l’ensemble P, mais dont les points extremes P,, 1*2, ... 
appartiennent a Pensemble. Puis, cboisissons un nombre irration- 
nel /, et ecrivons les nombres 



tels que dans le n irme groupe de nombres entrent seulement les 
nombres dc la forme ±j±: *- t (p, q 9 r, s etant des entiers posi- 

tifs on nuls dont la soinme cst qui ne different pas par des 
nombres entiers de ceux des groupes deja ecrits. Nous repr^sen- 
terons cette serie de nombres par les points mesures par Parc •{/ 
d’un cercle (*{/) ayant une circonference de longueur i, sur lequel 
ces nombres donnent unc s^rie dc points dislincts. 

Pour elfectuer une correspondance entre ces nombres sur le 
cercle (•}) et les intervalles du cercle (0), nous proc^derons comme 
il suit : nous cboisissons le nombre o pour corresponds a P,P a ; 
le nombre t qui vient ensuitc correspondra a P 3 P 4 ; le nombre 
correspondra au premier des intervalles suivants P/P/ +l , qui est tel 
que les intervalles P, P a , P 3 P», P/P/+* soient dans le inline ordre 
sur le cercle (ft) que les points o, /, sur le cercle (•{/); de m£mc, 

le nombre -t correspondra au premier inlervalle P>Py + . , non d£ja 
cboisi, qui est tel que les intervalles P,P a . P.P., P. P, + „ PyP> + , 
soient dans le inline ordre que o, t , ^ /, et ainsi de suite. 

Ainsi une correspondance ordonnee entre les nombres et tous 
les intervalles est ctablie. En effet, si tous les intervalles n’elaient 
pas choisis, il y aurait un premier intervalle non cboisi P«P«+'i 
mais cet intervalle est certaincment le plus grand de ceux qui sont 
situes entre deux certains intervalles cboisis a pres que les prece¬ 
dents Pont ele, et, par consequent, il sera cboisi lorsqu’il appa- 
raitra un nombre "l entre les nombres correspondant a ces deux 
intervalles. 
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Lc point o, sur le cercle ('I), joint a un point quelconque qui 
n’est pas de la forme ± - ± j elablil une separation des points 
de cette forme en deux ensembles ordonncs, ceux d’une classc 
precedant ceux de l ? autre. Sur le cercle (8), nous avons d'unc facon 
correspondante une separation en deux ensembles d’intervalles 
p 4 P /+l , relies de la m£me maniere, et, par suite, definissant un 
point-limite unique de 1’ensemble parfail. Ainsi une correspon¬ 
dence ordonnee et biunivoque est etablic entre, d’une part, tous les 
points du cercle (<{/) et, d’autre part, les couples de points P,P l+ i, 
joints aux points P reslants. 

La transformation -t transforme Tensemble dcs points 


du cercle (tj/) en lui-meme, et transforme ainsi Tensemble parfait 
du cercle (8) en lui-meme, de maniere que tout couple P,P, + i, se 
transforme en un certain couple determine PyPy+M ct que tout 
autre point de Tensemble parfait primilif vient en un point 
transform^ qui n est pas P,, P 2 . 

Cette transformation conserve Tordre circulaire sur le cercle (0), 
puisqu'elle le conserve sur le cercle (•!/). Ainsi, si nous regardons P, 
comine allant en Py, et P l+ , comme allant en nous avons 

defini une transformation continue biunivoque de Tensemble par¬ 
fait en lui-m^me, qui conserve Tordre dcs points P et qui est du 
type indique an d£but. 

En outre, si nous convenons qu’un point du cercle (0) situ£ 
entre I\- et P, + , est transforme dans le point semblableinent place 
entre Py et Py+i, nous avons defini une transformation continue 
biunivoque 8' = /(8) de tons les points du cercle (8), conservant 
Tordre, et transformant les points P d un ensemble parfait, nulle 
part dense, en eux-memes. 

Cette transformation a de plus la propricte que les puissances 
de la transformation et de son inverse amenent tout point dc Ten¬ 
semble dans le voisinage infinitesimal de n iinporle quel autre point 
de Tensemble. Ce dernier fait devient evident, si Ton se rappelle 
que, sur le cercle (•}), un point •!/„ vient dans un ensemble de 
points partout dense, au moyen de la transformation 

-+- t repetee m fois. 
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VII. — EXBMPLR D’UN MOUVEMHNT RECURRENT DISCONTINU. 

Nous pouvons maintenant chercher a obtenir un systeme dyna- 
mique qui admelle un mouvcment recurrent discontinu. Prenons 
une ligne droite l fixe, dans I’espace a trois dimensions dans 
lequel les courbes representatives du mouvement doivent 6tre 
d^finies, et soil o I'angle qu’un plan variable contenant l fait avec 
un plan fixe quclconque contenant l. Soient o et o' les traces sur 
ce plan variable d’un cerclc dont le centre est un point de /, et 
qui est situe dans un plan perpcndiculaire a l. 

Construisons, dans ce plan variable, le systeme des cercles 
coaxiaux admeltant / comme axe radical et dont o et o’ sont les 
cercles de rayon nul, et construisons aussi le systeme des cercles 
orthogonaux qui passent par o et o'. 

Un point P du plan variable, non situe sur /, est ddtermind par 
la distance rdeoau point ou le cerclc du premier systeme passant 
en P rencontre oo' prolong^e, et par I’angle 9, sous lequel le cercle 
du deuxieme systeme passant par le point rencontre la ligne 
droite oo'. 

Si ces coordonndes r, 9, ? sont prises comme coordonnles 
cylindriques dans un second espace (r, 9, cp), tous les points du 
premier, sauf ceux de /, correspondent d une manure biunivoque 
et continue aux points de la region ind^finie situde entre les plans 
paralleles cp = o et <p = 2 n. 

Nous allons d'ahord definir la courbe representative d’un mou- 
vament recurrent discontinu, et a cel eflet nous ferons usage de 
l’espace (r, 9, <p). 

Choisissons une transformation 9' = f( 9) du type indique dans 
le precedent paragraphe. Cette transformation peut etre regardee 
comme transformant tout point P du cercle r = i, ^ = o, sur le 
cylindre / == i, en un point P', distinct de P sur le cercle. Si main- 
tcnant nous developpons la portion de la surface du cylindre qui 
se trouve entre » = o ct o = ait, sur une bande indefinie dans un 
plan i 9, «p ), nous pouvons construire un ensemble de courbes PRQ, 
en joignant chaque point P sur In ligne tp = o an point Q de la 
ligne v = situe vis-a-vis du point P', la ligne PRQ etaiil com- 
posec de deux arcs egaux PR, RQ; de deux cercles egaux, ortho- 
13 . 
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gonaux respectivemcnt a ? = o el ? = 2 i, el Ungents en R. Les 
courbes PRQ remplissent la bande, et par suite la surface du 
cylindre; en tout point il passe une de ces courbes, et la direction 
de la tangente a ces courbes varie continuement. 

Dans l’espace primitif, les courbes PRQ repr^sentent un 
ensemble de courbes recouvrant une fois la surface du tore r = i 
et ayant partout une tangente qui varie continuement, puisque 
toutes les courbes PRQ ont une seule et meme direction de tan¬ 
gente pour (p = o et f = 27t. 

Gonsid^rons une courbe PRQ, dans laquelU P correspond a un 
point de I’ensemble parfait nulle part dense que 0' = /(B) laisse 
invariant. Le point Q est par construction oppose a P', transform^ 
de P, et par suite P'R'Q' repr^sente un arc sur le tore qui est la 
continuation de Parc represente par PRQ. Par une suite de sem- 
blables raisonnements, nous concluons que la courbe complete 
correspondante 2 sur le tore est represents par la totalis des 
arcs p(*>R<*>QC«>, ou P (/n se deduit de P par la r£p6tition de la 
transformation Q' = /(0) et de son inverse. 

Mais nous avons vu que celte transformation ou son inverse 
amdne P dans le voisinage infinitesimal de tout point de l’en- 
semble parfait invariant. Par suite si la courbe 2 est dSrile avec 
une vitesse qui fait croitre v avec une vitesse constante v, l’en- 
semble des points-limites alpha et omega du mouvement obtenu 
se compose de toutes les courbes represenlees par un arc PRQ, ou P 
est un point de l’ensemble parfait, et, en particulier, il comprcndra 
tous les points de 2 elle-meme. 

Puisque cet ensemble de points-limites est le meme quelle que 
soil la courbe 2 que nous choisissons, l’ensemble de mouvements 
est minimal, et 2 correspond a un mouvement recurrent dis¬ 
co n tin u. 

11 reste a d^finir un ensemble continu de mouvements qui com- 
prenne l’ensemble 2. 

Pour obtenir tous ces mouvements, nous relournons a l’espace 
(r, 0, <p) et nous d^pla^ons un plan <p = cde<p = oacp = 2 tc, avec 
une vitesse uniforme v. Le mouvement d’un point de ce plan sera 
parfaitement determine, si Ton convient : 

i° Que la ligne radiale 0 = c de ce point dans le plan mobile 
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ne cesse pas de rencontrer une seule et m£me courbe PRQ sur le 
cylindre r = i ; 

2° Que sa vitesse radiale ^ est donnde par r[a 2 -f-(i— r) 2 ], ou 

a est le plus petit des angles que fail, la ligne radiale avec toutes les 
lignes radiales du m£me plan cp = c, qui rencontrent l’image PRQ 
d’une courbe 2. 


Ainsi, nous dlterminons — et ^ comrae des fonctions continues 


de la position du point, qui ont la m£me valeur en deux points 
opposes dans les plans <p = o et f = 2it. La vitesse radiale est 
nulle pour /• = o et aussi pour les points du cylindre r — i qui se 
trouvent sur l’image d’une courbe 2, puisqu’alors a = o. 

Les courbes correspondantes dans I'espace primitif ont une 
direction de tangente qui varie d’une maniere continue partout 
except^ sur l. Par suite, elles reprdsentent les mouvements donnds 


par un syst£me d’£quations (i) pour n = 3, dans lesquelles x iy x 2 , 
x 3 sont les coordonn^es rectangulaires d’un point, et les fonc¬ 
tions X,, X 2 , X 3 sont continues, mais non analytiqucs. Les points 
singuliers S sont les points de/, auxquels X,, X 2 , X 3 sont discon¬ 
tinues, et il n’y a pas de points E. 

Tous les Inoncls donnas dans ce Mlmoire s’appliquent a un 
tel syst&me, pourvu que les conditions iniliales determinent le 
mouvcment d’une maniere unique, comme c’est ici le cas. 

Les courbes 2, Ividemment, donnent preeminent les mouve¬ 
ments ^currents discontinus pr^alablement construits. Pareille- 
ment, la courbe r = o correspond a un mouvement recurrent perio- 
dique. 

En outre, il ne peut y avoir d’autres mouvements recurrents, 
puisque le r de tout point, qui n’est pas sur r = o ou sur une 
courbe 2, croit constamment. En parliculier, il n’y a pas de mou¬ 
vements periodiques dans le voisinage infinitesimal d’un mouve¬ 
ment 2. 


( Extrait du Bulletin de la Societe mathematique de France, 
l. XL; 191a, p. 3 o 5 - 3 a 3 .) 
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PROOF OF POINCARE’S GEOMETRIC THEOREM 


BY 


GEORGE D. BIRKHOFF* 


In a paper recently published in the Rendiconti del Circolo 
Matematico di Palermo (vol. 33, 1912. pp. 375-407) and en¬ 
titled Sur un th&orbne de Giomitrie, PoiNCAR t enunciated a theorem of great 
importance, in particular for the restricted problem of three bodies; but. 
having only succeeded in treating a variety of special cases after long-con¬ 
tinued efforts, he gave out the theorem for the consideration of other mathe¬ 


maticians. j . . .. 

For some years I have been considering questions of a character similar to 
that presented by the theorem and it now turns out that methods which I 
have been using are here applicable. In the present paper I give the brief 
proof which I have obtained, but do not take up other results to which I 

have been led.t , . 

1. Statement of the Theorem. Poincare’s theorem may be stated in a 

simple form as follows: Let us suppose that a continuous one-to-one trans¬ 
formation T takes the ring R, formed by concentric circles C a and C b of radii 
a and b respectively (a > b > 0), into itself in such a way as to advance the 
points of C a in a positive sense, and the points of C b in the negative sense, and 
at the same time to preserve areas. Then there are at least two invariant 
pointsi 

In the proof of this theorem we shall use modified polar coordinates y - r 2 , 
x - 0 where r is the distance of the point (x, y) from the center of the circles, 
and 0 is the angle which a line from the center to (x, y) makes with a fixed 
line through the center. The transformation T may be written then 


x / - <p (x, y), y' = t (*, y). 


The function + (x, y) is a continuous function of (x, y), uniquely deter¬ 
mined at each point of R , and so is periodic in x of period 2 tt . The function 
<p(x,y) admits of an infinite number of determinations which differ from each 

• Presented to the Society, October 26, 1912. 

t Some of my results are contained in a paper entitled Quelques Ihtorhmts sur Us mouve- 
mtnts des sysUmes dynamiques, which is shortly to appear in the Bulletinde la Soci6t<$ 
Mathlmatique de France. 
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other by integral multiples of 2x, and these determinations can be grouped so 
as to form continuous branches. Since (x + 2ir, y) and (x, y) represent the 
same point of R, the algebraic difference between the values of one of these 
determinations taken for (x + 2ir, y) and (x, y) reduces to an integral mul¬ 
tiple of 2 t; and this difference must be one and the same multiple of 27r for 
all x and y because the difference is a continuous function. But if the point 
(x, y) makes a positive circuit of the circle C a , the same is true of its image 
(*'» y')\ hence along this path (x, y) increases by 2ir when x increases 
by 2tt . Thus the difference reduces identically to 2tt\ in other words, the 
function <p (x, y) increases by 2 ir when x is increased by 2 tt. 

In consequence of these properties of <p (x, y) and ^ (x, y) , it is clear that 
x' x and y' — y are both single-valued and continuous in R . 

The precise meaning of the theorem is that if any determination of <p (x, y) 
is made for which 

x' > x along Co and x' < x along C*, 

(the conditions on T make possible such a choice) then we shall have at least 
two points (x, y) of R for which 

y' « y. 

2. On the Method of Proof. As Poincar6 remarks (loc. cit., p. 377), the 
existence of one invariant point implies immediately the existence of a second 
invariant point. Hence if the theorem w false we may assume that there is no 
point invariant for T. In this case we shall have 

(1) (x / -x)*+(y , -y)*>d 2 > 0 

for all points of R since x' — x and y' — y are single-valued, continuous, and 
not simultaneously zero, over the ring R. We shall establish the theorem 
by proving that the hypothesis (1) admits a reductio ad absurdum. 

3. The Auxiliary Transformation. If 0 < t < b 7 , the one-to-one continuous 
transformation T, given by 

takes the circles C a and C b into the concentric circles C' a : y = a 2 — « and 
C' h :y = b 7 — € respectively, so that C a is within C fl at a distance a — Va 7 — t 
from C a , and C' h is similarly within C b at a distance b — Vb 7 — t from C b . 
This transformation effects a shrinking of the plane toward the origin which 
leaves every point on its radius vector and which preserves areas, since dx 
and dy are unaltered by the transformation and the integral of areas is 

Jfr dr dO = $ J j* dx dy . 
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Now, as long as « < A, the auxiliary transformation TT formed by fol¬ 
lowing T by T ., also has no invariant point. For if we lake x and y as the 
rectangular coordinates of a point in the strip S: 

- co < * < + “ , *>’<!*/? a 2 , 


corresponding to the ring ft, we see that the point (*. Jf) is displaced at least 
d units in S by T and then is further displaced a distance t by T, in the direc¬ 
tion of the negative y-axis, so that if € < d, the point cannot come back to its 
initial position. We write this compound transformation TT, in the form 

x' = <p(x, y), y' - *(x, y ) - «• 

Let us choose the positive quantity < once for all and so small that 

(2) « < b 7 , € < d , « < a* — 6*. 

Consider now the multiple-valued function 


(3) 


u (x, y) 


y' - y 

arc tanp^. 


or, more accurately, those branches of this function which give the angle that 
the vector drawn from the point (x, y) to the point (x', y') in the strip S 
makes with the positive direction of the x-axis (the other branches corre¬ 
sponding to the negative of this vector). In virtue of (1), this function is 
continuous at every point of 5 and accordingly falls into branches single-valued 
and continuous throughout S. 

Moreover any such branch reduces to a fixed even and odd multiple of 
t r along C a and Ct, respectively, since along C a we have x 1 > x, y' - y and 
along Ci>, x f < x, y' - y. 

The functions y # - y and x’ - x have been seen to be periodic in x of period 
2tt , and so any such branch of w(x, y) differs at (x + 2 t r, y) and (x, y) 
by a multiple of 2*■ which may reduce to zero. Since the branches are con¬ 
tinuous, this multiple is one and the same throughout S. But along C a and 
Cb these branches have a constant value, as we have noted. Hence the mul¬ 
tiple will in fact reduce to zero. Thus these branches of co (x, y) are periodic 
in x of period 2ir , i. e., are single-valued in R . 

Likewise it is clear that the multiple-valued function 


(4) 


y' — y 

«(*, y) = arc tan 5^T^ 


which gives the angle that the vector drawn from (x, y) to (x'„ y') makes 
with the positive direction of the x-axis, falls into single-valued and continous 
branches. Moreover the functions y' — y=y # —y—< and x'—x = x' —x are 
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periodic in x of period 2x; also each of the branches of Z> (x, y) is periodic in 
x along Co and along C b . We conclude thatthe branches of a>(x, y) as 
well as of o> (x, y) are periodic in x of period 2x, i. e., are single-valued in R. 

The branches of a > (x, y) and u (x, y) may be associated in pairs so that 
the maximum numerical difference of any pair for every ( x , y) is regulated 
in accordance with the formula 

( 5 ) I " (*» y ) - Zj(x, y) | < 

This is obvious geometrically, for the point (x', y') is at least d units distant 
from the point (x, y) in S, and (x', y‘) is distant e < d units from (*', y') 
so thatthe angle subtended at (x, y) by these last-named points can never 
become equal to \x . Thus if we associate the branches at a single point of S 
in accordance with (5), this inequality must continue to be true as the co¬ 
ordinates of the point vary continuously in S. 

4. Construction of a Curve Invariant for the Auxiliary Transformation. 
The transformation 7T, of R takes C a to the circle Cl of radius 
between C a and C b , since we chose « < a 7 - 6*. The repetition of 7T, takes 
C a into Co , a simple closed curve. Now 7T, is a one-to-one and continuous 
transformation of the ring R into the ring R ', limited by C ' m , Cl and obtained 
by letting R shrink under the transformation T ,. Since Cl lies within R and 
encloses C bt its image Cl' must lie within R' (and accordingly within Cl) and 
enclose Cl. Furthermore the transformation TT t takes the ring formed by 
C a C a in R into a second ring Cl Cl' in R ', which abuts on the inner boundary C' a 
of the first ring. If Cl' lies wholly in R it will enclose C b as well as Cl , 
and the image Cl of Cl by TT t will be a simple curve within Cl' and enclosing 
C* • Thus Cl Cl will form the boundary of a third ring abutting on the 
inner boundary of the second ring Cl Cl'. This process may be continued 
to form simple curves C«, Cl', • • • lying one within the other, and correspond¬ 
ing rings C a Co, Co Co , • • •, so long as the curves continue to lie wholly in 
R and not in part or wholly within C b . 

The area of the ring C a Cl is ir« and will, since the transformations T and T , 
each preserve areas, be the same as that of its image rings Cl Cl', Cl' Cl", • • •. 
This series of rings can only terminate when a curve Ca is reached (nj 2) 
part, at least, of which lies within the circle C b . But for every /, the area in¬ 
cluded between C a and Cl' 1 is xU, which if l is sufficiently large exceeds the 
urea of R. It follows that such a curve CV* exists. Hence we can find a 
particular point P of C a whose nth image lies inside of the circle C b . 

Now let us turn to the strip 5 and to one of the representations of P on the 
upper side C a of the strip. Here Cl is represented by a straight line at distance 
€ below C a , and Cl , Cl' , • • • are each represented by a simple open curve 
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congruent by sections 2fcr ? * ? 2(* + 1) * and extending indefinitely to 
right and left. The rings C.Cl, C.CL , ••• clearly are represented by the 
successive strata between successive curves of this set m S. Let f De I 
first image of P and join PP' by a straight line which will he entirely within 
the strip C a C' a . Let the arc ?' P" be the image of PP' under TT., and let 
P" P"' be the image of P' P" under the same transformation, and so on. In 
this way we construct successively PP', P'P",---, P ln ~ l) P M respec¬ 
tively in the strata C. <5, (5 CT . — . CT 11 CP in S, the end point P<"> of 
the last of these arcs falling below the lower side C b of S. 

Let Q be the first intersection of this succession of arcs with the lower side 



C b of the strip (see figure). It is obvious that the curve PQ formed by this 
succession of arcs is a simple curve, for PP 1 , P' P" , • • • are successive simple 
arcs which lie in the successive strata C a C' a , Cl Cl , • • • in S. Furthermore 
PQ lies wholly between C a and C b . 

The image P' Q' of PQ under the transformation TT, is made up of P' Q 
and an extended simple arc QQ' , the image of the arc Q~ l Q where Q~ l is 
the point that goes into Q by TT, . The arc QQ' being the image of points 
of R by TT, lies wholly below the straight line C'.\ the end-point Q' of this 
arc lies of course^. Furthermore QQ' has no point but Q in common with 
PQ. For if such a point exists it must lie on P'Q, and by performing the 
transformation inverse to TT, we see that Q~ l Q has a point other than Q~ l 
in common with PQ- 1 , which is not possible since PQ has no multiple points. 
Thus PQ' forms a simple curve. 

The transformation TT, takes the arc PQ of PQ' into the arc P' Q' of PQ ', 
advancing each point of PQ along PQ'. In this sense the curve PQ is in¬ 
variant -mder the transformation. 

The properties of T and T, ensure that P' has an x greater than that of P, 
and that Q' has an x less than that of Q , as indicated in the figure. 

5. The Rotation of the Auxiliary Point-Image Vector on the Invariant 
Curve. If a point D moves along PQ' from P to Q , it is clear that its image B' 
by TT, moves from P' to Q' along the same curve, never coinciding with B. 
We shall now establish the fact (intuitively almost self-evident) that the 
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corresponding rotation of the vector BB' thus obtained is — 7 r plus the sum 
of the two acute angles which the straight lines PP' and QQ' make with the 
ar-axis. 

At the outset it is obvious that the rotation can differ from this value only 
by a multiple of 2 t. That the rotation has precisely the value stated 
depends entirely on simple considerations of analysis situs. The fact on which 
this conclusion rests is that a continuous deformation of the curve PP' QQ' 
through a series of simple curves containing P , P ', Q , Q' brings the curve to 
the broken line position PP' QQ' (see figure). 

Let t be any monotonic parameter for the curve PQ taking on the increasing 
values t 0 , to, t lf t'^nt P, P', Q, Q' respectively. Let r (<) be the value of 
this parameter for W where t is its value for the corresponding point B . Clearly 
r (t) is a continuous increasing function of t (t 0 ? t ^ t x ) which has the 
property r (t) > t . Consider now any varied simple curve through the same 
four points P, P' ,Q,Q' on which a monotonic parameter t is so chosen that as 
before t 0 , t'o,ti, t\ correspond to P , P ', Q , Q' respectively.* If the distance 
between any point of the varied curve and the point of PQ with the same 
parameter value is uniformly small, the corresponding vector BB' along the 
varied curve will undergo precisely the same total rotation as along PQ, 
since the initial and final positions are the same in either case and the 
angular differences of the vectors in all intermediate positions are uniformly 
small also. 

As a consequence of this reasoning it follows that we may deform the curve 
PP' QQ' continuously through any series of simple curves containing the same 
four points P, P', Q, Q' provided that for the varied curves the parameter 
is properly chosen, and the total rotation of BB' will not thereby be altered. 

The curve PQ lies wholly in S, so that the straight line QQ', being out¬ 
side of S, does not intersect the curve PQ . It is therefore apparent that 
the arc QQ' may be continuously deformed to this straight line position QQ' 
without taking the curve QQ' outside of the strip formed by C« and Ci; for the 
continuum formed by this strip is simply-connected after a cut in it by the 
curve P'Q is made, and hence any simple curve joining the points Q and Q' 
of this continuum, and lying in it, can be continuously deformed into any 
other such simple curve through a series of simple arcs not having any 
points but Q and Q' in common with the boundary of the cut strip. 

Next, the arc P' Q may be continously deformed, through a series of simple 
arcs joining P' to Q in the strip C« C b in which P' Q wholly lies, into the 
straight line PQ, inasmuch as the straight lines QQ' and PP' have no points 
within this strip. 

• I include in the term “ varied simple curve ” any simple curves with the same geometric 
locus but with a different parameter. 
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Hence the rotation of BB' during its first series of positions is precisely the 
same as it is along the broken line formed by the three straight segments 
PP' P ' Q, QQ' ■ The rotation along the broken line from the initial to the 
final position is clearly - tt plus the sum of the acute angles which the vectors 
pp> an d QQ' make with the rr-axis. Thus our statement is proved. 

In the first series of positions, B' is obtained from B by the transformation 
TT ,. If the coordinates of B are (*, y ), those of B' are accordingly (*', y'), 
and the rotation of BB' is measured by the change in the Junction u (x, y) as 
(x,y) moves from P to Q along the invariant^ curve PQ- 

Let us fix upon that continuous branch u x (x, y) of this function which is 
measured at the point P by the negative of the acute angle which the vector 
PP' makes with the r-axis. At the point Q this determination will have the 
value — 7 r plus the acute angle which the vector QQ' makes with the avaxis. 

6 . The Rotation of the Point-Image Vector for T. Now fix upon that 
continuous branch w, (x, y) of the function to (x,y) which, along the upper 
side Co of S , takes on the value zero. 

These two functions «, (x, y) and o>, (x, y) differ by less than \ tt at the point 
P and hence are branches of w ( x , y) and « ( x , y) associated by the inequality 
(5), which holds throughout S. Moreover the term ini value of the function 
wi (*» y) has been shown to differ from — rr by less than \rc at Q, so that 
w x (x, y) differs from - rr by less than t at Q. Any branch of the function 
a )(x, y) however has been seen to be precisely equal to a fixed odd multiple 
of t r along C h . Hence this function u>, (x, y) must have the value - tt at 
Q and also at all points of C*. 

The variation of o> x (x, y) is therefore - ir when the point (x,y) moves in 
any manner whatever from a point of C a to a point of Ct>. In other words 
if we let the point B move in any manner from C n to Ct in S, and let B' denote 
the image of B by the transformation T, the total rotation in the rector BB' will 
be precisely — it . 

7. Completion of the Proof. Consider now the transformation T~ l inverse 
to T, which is in every respect similar to T except that points on C„ and Ct, , 
and hence on the two sides of the strip are moved in the reverse direction. 
By symmetry, the vector B'B which joins B' to its image B under T 1 must 
now rotate through an angle + rc as B and B' move from C u to Cb . 

Here the vector B'B is of sense opposite to that of BB'. But the actual 
rotation of the vectors BB' and B'B is of course one and the same quantity, so 
that a contradiction has been reached. Hence the theorem is completely 
proved. 

8 . Generalizations of the Theorem. In phrasing his geometric theorem Poin¬ 
care makes the hypothesis that we have some integral invariant ff P( x,y) dxdy 
(P (x, y) > 0), not necessarily the invariant of areas. However, we can, 
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by a suitable change of coordinates from ( x,y) to ({,77) , change the 
integral invariant to the simpler area invariant. 

Let the lines 77 = const, be the circles concentric with C a and Cb, and 
let the number 17 (y) be so chosen for each circle that as a point moves from 
Ct to C a the double integral taken over the ring between Cb and the concentric 
circle through the moving point is equal to 77: 

v = XIX p I dy ' 

This function 77 (y) is clearly continuous and increasing and has a positive 
continuous derivative, namely 

^r = /* i)dx - 

0 

Speaking somewhat inexactly, the 77-curves are all so placed as to measure 
off equal increments of ff P (x, y) dxdy for equal increments of 77. Now 
(speaking in the same inexact sense) choose the ^-curves so as to measure off 
equal increments of the same double integral between these successive 77-curves 
and the line x = 0. 

The possibility of making such a choice of curves £ = const, may be seen 
as follows: Let x = f (y) be any curve lying in R, joining C a to Cb, and such 
that the part of the area between the initial line x = 0, this curve, the circle 
Cb and the circle y = const, is constantly proportional to 77 ( y): 

XIX *<*■*> <**}<*»-£»(«'>• 

In view of the above mentioned properties of 17 (y) , this equation is equivalent 

<«) jf PC.,)*-*4^12. 

For a given k, the quantity / (y ) is clearly a single-valued continuous function 
of y since P (x, y) and drj (y)/ dy are positive and continuous. Moreover 
an increase in k continuously increases /(y). Thus, we get a set of non- 
intersecting curves x = f (y) which, like the circles 77 = const., fill up the 
ring R. It is necessary to notice that x = 0 corresponds to k = 0, and that 
x = 2 tt is the curve for k = 2 tt by definition of 77 (y ). We take k as deter¬ 
mined by the above method to be the coordinate £ of any point on the curve 
* =/(y). 

Thus we have coordinates (£, 77) for which the integral invariant is the 
invariant ffd^drj. Taking (£, 77) as the modified polar coordinates of a point 
in a new plane, we find that, expressed in these coordinates, T has all the 
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properties specified for T in the theorem. Thus we infer that there are at 
least two invariant points as before. 

A further generalization is that the curves C a and C b may be allowed to be 
any simply closed curves, one within the other, bounding a ring R\ we may 
again state a similar theorem, for we can make a preliminary transformation, 
for instance a conformal one, to take these curves into concentric circles, when 
of course the integral invariant merely changes form. Here it may be necessary 
to consider with care the nature of the integral invariant near the boundaries 
after the transformation of coordinates. 

Finally we may permit the function P (x,y) to vanish at some or all points 
of the curves C a and C b . Under certain restrictions it is certain that in this 
limiting case there will be invariant points also. 

9. Poincare’s Method. It is interesting to notice that Poincar6 uses only 
a single property consequent on the existence of an invariant integral, namely 
that no continuum on R can be transformed into part of itself by the transfor¬ 
mation T (loc. cit.. p. 377). It seems improbable that this condition is equiva¬ 
lent to the condition that there exists an invariant integral. The existence of 
such an invariant integral is a fact which enters more intimately into the proof 
which I have given above. I do not know whether the modification of 
Poincare’s theorem which results when the condition that an integral invari¬ 
ant exists is replaced by this weaker condition is true. 

Harvard University, 

October 20, 1912 
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By George 0. Birkhoff (Cambridge, Mass.). 


F.tlM'to J.l lomo \X\IX (i® um. 191 0 4ci MmOUnmU d*l Clrroto MaUmatUo .11 I’Olrrmo. 

Adsoiaza l«l *| tgotio iyi«. 


Thorough investigation of non-integrable dynamical problems is essential for the 
further progress of dynamics. Up to the present time only the periodic movements 
and certain closely allied movements have been treated with any degree of success in 
such problems, but the final goal of dynamics embraces the characterization of all 
types of movement, and of their interrelation. 

The so-called restricted problem of three bodies, in which a particle of zero mass 
moves subject to the attraction of two other bodies of positive mass rotating in circles 
about their centre of gravity, affords a typical and important example of a non-intc- 
grablc dynamical system. It is this problem which I consider in the present paper. 

Part I (§§ 1 - 5 ) deals with the differential equations of the problem. 

As ordinarily given, the equations of motion ($ i) admit of three singular points 
corresponding to the collision of the particle with either body and to its rejection to 
infinity. Ii was the notable achievement of LeviCivita ') to have discovered that the 


') T. LlYI-ClVlTA. Sur la resolution qualitative du problem/ resheint del troil corps |Acta M.ithc- 
nutica, t. XXX ( 1906 ). pp. }OS ) 27 |. See also an earlier paper by the same author. Traiettorie singolari 
cd urti ml problem x r is tret to dei tre corps (Annali di Matematica. ser. HI. voL IX ( 1904 ), pp. 

(Note addeJ January p, 1914 . As I have been kindly apprised by Professor Levi Civita, he has 
found marc recently that a regularization at collision with both bodies had been given by T. N. Thiele, 
Reciter dies numerous concemant des solutions periodiques d’un cas special du problem/ d/s trois corps 
(Troisifcnte mi moire) l Astronomische Nachrichten, vol. CXXXVIH ( 189 s). pp. mo], who. however, 
deduced no theoretical consequences therefrom. This regularization, less simple then that of Levi-Civita 
and also falling under the type treated by me in $ a, » entirely distinct from the algebraic regulariza¬ 
tion given in $ $, and used later in the paper. 

Dr. G. M. Green and I have both noted that all of these forms of the equations of motion 
become canonical, for a given value C of the constant of Jacobi, if the variables of the first set arc 
11 , t* ($ 2 ). and of the conjugate set are 11 , v, given bv 

" = (if ~ ’) ’■ + (if + ‘) 7 “ (tt ~ ’) *-+ (if+*) • 

R,*J. c.u. IUlfm*. 1. XXXIX (1® MM. 191s). —Su»r«o 11 9 ■UJT® i- 
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equations of motion admit of regularization so far as collision with a given one of the 
bodies is concerned. By the aid of this regularization it became possible to determine 
the character of the motion near collision without difficulty. 

On the basis of a new transformation of variables (§ 2) applicable to a consider¬ 
able range of dynamical problems, I obtain the well-known equations of normal and 
tangential displacement (§ j), the equations of Levi-Civita although not in canonical 
form (§ 4), and a new form of the equations of motion in which the only singularity 
corresponds to the rejection of the particle to infinity (§ 5). 

In Part II (§§ 6-8) the nature of the manifolds of states of motion is considered; 
by a state of motion I mean the position of the particle together with its vector velo¬ 
city at an instant. 

The new form of the equations affords a non-singular representation of the man¬ 
ifolds of states of motion for any value of the constant of Jacobi, and allows one to 
represent the movements under consideration as the stream lines of a three-dimensional 
flow in non-singular steady motion (5 6). 

In this way a characterization of the states of motion in the six possible cases 
may be made without difficulty ($§ 7 » 8). For example, it is found that when the 
particle is confined to an oval of zero velocity about one of the bodies, the manifold 
is in one-to-one continuous correspondence with the points inside of and on a sphere, 
under the proviso that diametrically opposite points are counted as identical; under 
the same proviso, it turns out that, when the oval of zero velocity includes both 
bodies, the manifold of states of motion is in one-to-one continuous correspondence 
with the points between and on two concentric spheres. 

The case of the problem in which one of the two bodies is of mass zero admits 
of explicit solution. PoiwCAtfe *) generalized from this limiting case and was led to 
a representation in the space of inversion as long as the particle was confined to an 
oval of zero velocity containing one of the bodies. It was not seen by him however 
that this representation is two-to-onc, a fact in accord with the one-to-one representation 
bv a sphere stated above 3 ). 

The further results of the present paper are concerned only with the case in 
which the particle is confined to such an oval. 

The same generalization led Poivcark to the fundamental concept that the re¬ 
stricted problem of three bodies admits of reduction to the transformation of a ring 


while the principal function is 

(t[( t *) + (vf) ] ~ [° (I - '> - -r c ]) • 

and the independent variable is t]. 

a ) H. PoincarC. Let m/ihodei no uv tiles de la MteanLfue Celeste, voL III (Pam. Gauthier-Yil- 
lars, 1899), pp. 196-200, 

a) H. PoincarE, Sur un tbeoreme Je Geometrie fUcn.li:onti del Circolo Matcnvtlico di Palermo, 
t. XXXIII (i° semestre 1912), pp. ) 7 $- 407 |. 
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into itself. Part III (§§ 915) of the present paper is devoted to the further investigation 
of this concept. 

I commence with a consideration of the case in which one of the bodies is of 
mass zero, and arrive at a geometrical representation of the ring in this case (§§ 9, 10). 

In order to generalize this interpretation certain properties of the family of direct 
and retrograde periodic orbits are established ($$ 11, 12). As the constant of Jacobi 
increases from the given value to infinity a continuous ring of such periodic orbits is 
obtained. The states of motion belonging to the ring for the given value of this con¬ 
stant arc the states of positive tangcncy to these auxiliary orbits. If a particle be projected 
from any point A' iu the plane of the motion tangent to the auxiliary orbit through 
the point, the particle will become tangent again to other such orbits at points I, 
M , ... infinitely often. The transformation of the ring is defined as that which takes 
K into the first such point L (§ 13). 

In this manner a complete representation of the ring transformation in the plane 
of the motion becomes possible. The ring employed may of course also be thought 
of as situated in the space representing the manifold of states of motion. It differs from 
the ring obtained by PoiNCARfe merely in its simple and more specific character. 

A fundamental property of this and similar transformations, noted by Poincar 6 , 
is that an area integral is invariant. A further property, which I have found to hold 
in the restricted problem of three bodies, is that the transformation is the product of 
two involutory transformations ($ 14). The latter property is the important one for 
the present paper. 

The construction of a ring outlined above is valid only for suitably restricted 
values of the masses. A necessary and sufficient condition for the existence of a ring 

can be found on the hypothesis that direct and retrograde orbits of simple type, cor¬ 

responding to the two boundaries of the ring, exist ($ 15). 

Part IV (§§ 16-21) is given over to the consideration of the periodic orbits. Later 
I expect to publish the results which I have obtained concerning the general orbit. 

The question of the existence of periodic orbits is of great importance. The nota¬ 
ble work of Hill *) and Darwin 4 5 ) leaves no practical doubt that direct periodic or¬ 

bits of simple type continue to exist through wide ranges of values of the masses and 
of the constant of Jacobi, Moreover, Moulton w ) has observed that retrograde orbits 
cannot vary through a cusped form under certain conditions, and so seem to have a 
peculiar quality of permanence. His interesting conjectures concerning these orbits have 
been borne out by later computations made under his direction. 


4 ) G. W. Hill. Researches in Iht Lunar Theory [American Journal of Mathematics, vol. 1(1878), 

pp. 5-26. 129-147. 245-26oJ. 

5) G. H. Darwin, Periodic Orbits [Acta Mathematica. vol. XXI (1897), pp. 99-242!. 

*) F. R. Moulton, Relations among Families 0/ Periodic Orbits in the Restricted Problem of Three 
Bodies | Proceedings ol the filth International Congress ol Mathematicians, Cambridge, 1913, vol. II, 

pp. 18.’-187 J. 
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Notwithstanding these facts, however, the existence of these direct and retrograde 
orbits has only been rigorously established for sufficiently small values of one of the 
masses, or for a sufficiently large value of the constant of J\cobi 7 ). 

By a simple method I have been able to demonstrate that, as long as the par¬ 
ticle is confined to an oval of zero velocity about one of the bodies, there necessarily 
exists at least one retrograde orbit of simple type (§§ 16, 17, 18). 

The direct orbits do not appear to be amenable to a like attack. I have, however, 
been able to establish that, for sufficiently small values of one of the masses, the ring 
transformation of PoincarE may be replaced by the transformation of a discoid into 
itself whose only boundary corresponds to the retrograde orbit, and that, as long as 
this construction is possible, there will necessarily be one direct orbit of simple type 

The balance of evidence seems to render it probable that this new form of trans¬ 
formation continues to exist as long as there is an oval of zero velocity about each of 
the two masses. The ring transformation, on the contrary, necessarily breaks down 
when the direct orbit becomes unstable. 

In a last paper PoincarE 3 ) established, that, if a certain geometric theorem were 
true, the existence of a ring transformation carried with it the existence of an infinite 
number of periodic solutions of the restricted problem of three bodies. This geometric 
theorem was later proved by me *). The property of the ring used is that an area 
integral remains invariant under the transformation. Since the earlier work of Poin¬ 
care had only established the existence of an arbitrarily large number of periodic so¬ 
lutions for sufficiently small values of one of the masses, this conclusion constituted a 
definite step in advance, made possible only by the reduction of the restricted problem 
to the transformation of a ring into itself. 

It was stated above that this transformation is the product of two involutory 
transformations. The existence of an infinite number of symmetric periodic orbits flows 
intuitively from this fact; by its means also very definite conclusions concerning the 
characteristic properties of these orbits, and concerning their distribution in the plane 
of the motion, may be reached (§ 20). 

The property used by PoincarE enters, however, into the proof of certain geome¬ 
tric facts (§ 21). For example, as long as the ring exists, it may be proved that, for 
the given value of the constant of Jacobi and a given point in the plane, an initial 
direction of projection may be chosen in an infinite variety of ways so that the par¬ 
ticle passes through the initial point at a later time. 


7) A criterion for the existence of periodic orbits has been given by E. T. Whittaker, On Periodic 
Orbits in the Restricted Problem of Three Bodies [Monthly Notices of the Royal Astronomical Society, 
vol. LXII (19011902), pp. 546-Jj2j. 

a ) G. D. BiRKHOFF, Proof of PoiNCARt'r Geometric Theorem [Transactions of the American 
Mathematical Society, vol. XIV (191))* PP* < 4 * 221 . Demonstration du dernier Theorems de Geemelrie de 
PomCABfi [Bulletin de la Social* Mathdmatique de France, vol. XLII (1914). PP- M»J (translation). 
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Part I. 

THE DIFFERENTIAL EQUATIONS OF THE PROBLEM. 


s >• 

The Usual Form of the Equations of Motion. 

The restricted problem of three bodies in its customary form deals with the rela¬ 
tive motion of a particle P of zero mass lying in one and the same plane with two 
bodies S and J, which attract each other and P according to the Newtonian law, and 
rotate positively in circles about their center of gravity O. By a specialization of units 
the masses of S and / become p and i — p. respectively, while the distance from S 
to /, the angular velocity of S and /, and the gravitational constant, reduce to unity. 

If the rectangular coordinates of P relative to rotating axes, taken along the line 
SJ and perpendicular to it at 0, arc x and y respectively (fig. i), and if I indicates 



the time, the differential equations of motion arc 

(') r£-*5 = “■<*• *>- + *37 = u > (x ’ 

where 

I = V(x - py +7, r, = 1'(*+>-(*)■+/• 

In accordance with the notation for partial derivatives employed throughout the 
present paper, 12 ,(x, y) and ii,(x, y) denote the partial derivatives of ll(x, y) as to 
x and y respectively. 

Bv multiplying the first and second equations (i) by 2 dx/dt and 2 dy/dt respec- 
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lively adding and integrating, we obtain the integral of Jacobi 

(sy-tsy+tay—•'*-« 

where s measures the arc length along the orbit of P , and C is the constant of Jacobi. 

In what follows we consider invariably the totality of motions of P for which the 
onstant C has a given value. This specialization amounts to a use of the integral of 
Jacobi to reduce the system of deferential equations (i) from the fourth order to the 

,h,rd A simple way of accomplishing this reduction depends on the introduction of the 

new variable ^ 

, N o = arc tan . J 

(4) ? dx 

representing the angle between the direction of motion at any instant and the direc¬ 
tion of the positive x-axis. We obtain at once from (1) and (3) 

y) — C cos ? = X(x, y, ?), 
a t 


(5) 


d d > = V2ll(x, y) — C sin? = K(x, y , ?), 

d? , , (*.>)«n ? + U/x,y)cos? 

~Ai = - 2 +-r 


?)• 


V2U(x, y) -C 

The first two equations result from the equations dx/ds = cos?, dy/ds — sin?, and 
from (3). The last equation may be deduced by forming dy/di, and eliminating dx/dl } 
dy/dt, d‘xjdt\ d‘ y/d t‘ by the aid of the first two equations (5) and the equations 
(1). The equations (5) arc equivalent to the equations (1) joined with the integral 
(3), and form a system of the third order. 

If we observe that the right-hand members of the equations (5) do not involve 
the variable /, it becomes evident that this variable may be eliminated and a reduction 
to the second order effected. The resulting equations may be written 


d x _ _ dy _ dy 

X{x t y, ?) — K(x, y, ?) d»(x, y, ?) 

Therefore, if x, y, ? be taken as the rectangular coordinates of a point in space, 
the totality of curves of motion under consideration will be represented by a family of 
curves, one passing through each point of the cylindrical region U (x, y)--C^o to 
which the motion is necessarily confined. 

This representation is singular only when the velocity is zero or infinite. The 
first of these two possibilities takes place when P is on the curve of zero velocity 
ii(x, y ) — iC = o, and docs not correspond to any singularity of the motion. The 
second takes place when P collides with 5 or /. 

As time varies, each point of x, y, ?-spacc moves with a velocity, dependent on 


687 




THE RESTRICTED PROBLEM OP THREE BODIES. 


7 


position only, along its curve of motion. Thus the totality of motions will correspond 
precisely to the steady motion of a fluid, which is incompressible in virtue of the 
identity 

(6) X,(x, y, ?) + P,(x, y, ?) + •»*,(*, y, ?) = o. 

Although this representation gives us some idea of the nature of the problem, it 
is possible to eliminate entirely the singularities of the representation by a suitable 
choice of variables. Such an elimination will be effected in § j. 


S 


On the Transformation of Certain Differential Equations. 


Let us turn for a moment to the consideration of the type of differential system 


O') 


*j, - 2Hx, ,)% = U.(*, J-), 


77 + **(*. y)^ = u,(*. y). 


which constitutes a generalization of (1). We shall regard t as the time, and x, y as 
the coordinates of a moving particle in the x, y-plane. 

We consider the triply infinite set of solutions of this system for which 


O') 



= 2 l/(x # y), 


where s measures length of arc. If now wc introduce the variable 9 defined as in 
(4) wc obtain without difficulty 


( 5 ') 


Yi = ^ 2 U (x, y) cos 9, 
d } t = fTU(x, jr) sin 9, 

dt .... — u .(*> y) sin ?+ u ,( x < y)“*t 

-* = -, H x,y)+ - y2(J(Xi - yf ----- 


The curvature K of the orbit is dy/d *, which may be expressed in terms of x, 
y, 9 by the aid of (3') and the last one of the equations (5'): 

„ — 2*(x, y ) , — u *( x * *)*“* + y) cos ? 

72V&; yj* 2U(x t y) 

This gives an intrinsic formulation of the characteristic geometrical property of the 
curves of motion. 

Let n be the distance measured along the normal to the curve of motion at (x, y) 
so chosen that the directions along which s and n increase are oriented like the pos¬ 
itive x- and y-axes respectively. Since 9 denotes the angle between the instantaneous 
direction of motion and the positive x-axis, the angle between the instantaneous normal 
and the positive x-axis is 9 + \ Thus the normal derivative of any function /(x, y) 
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is -/,(x, y)sn T +/,(*, y)cos?. Bearing this in mind we may formulate the fol- 

lowing conclusion: . 

The totality of carve, * = x(l), y = y (0 defined by the equatwn, (i ), (3 ) 
same as that defined by the equation 


( 7 ) 


d " can ^ = _ L+J-»logy ( x, y>, 

77 /TU(x, y) T 


where , measures the arc of the carve and n i, the distance measured along the inslantan■ 
eous normal. 

To complete a proof of this statement we need merely to show that we may 
pass conversely from (7) to (i'), 0 ’)- But if we write 


? = arc.an£, t = fj=^, 


we deduce (;') at once from (7), and thence (i')» 00- 

Let us now consider the result of making a transformation_from the x, y to_the 

u, v-plane defined by an equation { =/(«') where \ = * 4 " /~“ 1 lV} 
and where f(tv) is an analytic function of w. 

To effect this transformation of the totality of curves (l')» ( 3 ') wc first nolc lhc 

relation _ _ 

d x + f^ldy =f(w){du -f- /— 1 

in which f(w) denotes the derivative of f(w). From this follows 

arc tan ^ = arc tan ^ -f ar g/' ( w )» 

and also 

ds = \f(w)\ds t , dn = \f'(w)\dn i} 

where s , and n, arc measured along the transformed curve and along its instantaneous 
normal respectively. Here wc make use of familiar properties of a conformal transfor¬ 
mation * = /(u/). 

Furthermore, inasmuch as log|/(w)| and arg f (w) are conjugate harmonic func¬ 
tions, we have 

£ arg/-(<*) = — ^jlogimi 

by the Cauchy-Riemanm differential equations. 

Making use of these relations we obtain from (7) 


" C ' ,n ru - _a[»(x, ,)\ row] , J_ a fl 

d‘. Yi U(x, y)l/’(u-)|‘ J 


This is the intrinsic equation of the transformed curves, and 
(7), with x, y, X(x, y), and ( 7 (x, y) replaced by u, v, l(x, 
respectively. 


y)i n*n 

is precisely of the form 

y)\f ( w )\‘> and u ( x > y) 
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Accordingly, by the italicized principle above, the curves of motion, referred to 
the coordinates u and v, satisfy the equations 


( 9 ) 


- 2 x(x, ainuor^ = ["(*. rtiz-wj., 

j )dir wri. 


Moreover, since ds = \f(u/)\ds lt we infer at once that 

dt 

By the transformation of variables 


x+ /=!■, =f(u + f=r,v\ d, = \nw)\‘dr, 

the totality of curves x = x(t), y = y(t) defined by the equations (I')» (3*) go" into 
the totality of curves defined by a similar set of equations (9) obtained from (1'), (3') 
by replacing x, y, /, X(x, y), U(x, y) by «, v, r, X(x, y)\f(w)\\ >)l/'WI* 

respectively. 


5 3 - 


The Equations of Normal and Tangential Displacement. 


As a first application of this method of transformation, I shall derive the equa¬ 
tions of normal and tangential displacement for the restricted problem of three bodies ®). 
These equations will be useful later in the paper. 

Let x(t), y(t) be a solution of (1), (3), and let us make a conformal transfor¬ 
mation of the x, y-plane which takes the corresponding orbit into the real axis in 
the transformed plane in such a way as to preserve arc lengths along the orbit. This 
transformation has the explicit form 

; = /(«-) = /. 0*0 + V^if, («■), («(0 = /, ( 0 . y ( 0 =/. ( 0 )- 

Owing to the fact that the transformation is conformal, normal derivatives along the 
given orbit arc unaltered by it. For this reason we will write w = s ^— 1 n. 

The application of the method of transformation to equations (1). (3), for which 
>(x, y) = 1 and £/(x, y) = U(x, y) - ^ C, with the above choice of /(a-), gives 


®) Cf. Darwin, loc. cit. *), pp. 133-141. 

Kind. Ci’/. SUlrm. Pahimr, I. X\\|\ (,o M(n . iflf). — SlmpiM il 9 1911. 1 


690 



GEORGE D. B1RKHOFF. 


the equations 

& - 2\rm£ = f.o. »). + >\rw'A = ^ ">• 

^y+(gy-^->. 

Wher * F(*, n) = [U(x, >) - i C] \T (u')I'- 

Along the curve of motion wc have 


I/’(“01 


= l?-i = i 

dw 


so that dt = dr along this curve. Moreover this curve corresponds to the solution 

s = s 0 (r), tt = n o = o 

of the transformed equations. 

Substituting in these values for \f(w% r, r, w, we obtain 

d ir = 


from which wc deduce the condition 

7 f . 0.» °) = iCTKTSi. 

Suppose now that wc consider a varied solution f = -f cXj, n = c&m where 
c is a constant which is to approach zero. The limiting forms of as t ap¬ 

proaches zero arc obtained as the solutions of the equations 

. 

¥*FO .:°)tV = «)*« + M'aF(«., o)*.., 

deduced by the usual method of variation. Here wc have made use of the equations 
noted above and also of the equation 

F (s , o)= ?5foj.°2 

'• U ’ J I'aFfc.o) 

derived directly from them. 

The first of the three equations for Sj, $/i may be obtained from the last by 
differentiation. This is to be expected in consequence of the fact that (i) and (3) arc 
not independent. 

By combination of the second and third equations we can eliminate &j. Thus 
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there results the final form of the equations of displacement 
d'Sn 


(io) 


^ + [4 + 2V2 F(* # , 0 )^ 1 /(Ol’ - F„.(s ol o)]*n = o, 

*h- fX*.. °) 

77 P*P(*„ o) 


n. 


If we agree to let /?(p) denote the real part of p, and to let 7 (p) denote the 
coefficient of l/— 1 in p, the functions /(u») and F(s, n) which enter are given explic¬ 
itly by the equations 


(") fo , »)=[“(*(/(«')]. '[/(«-)])-7 C]l/'(«OT. 

where I is the function of w = s - f- f'— 1 n such that 


“•> -=f t/wr+tefh 

Moreover is the arc length taken along the orbit. 

I lie first and second equations (10) may be termed the differential equations of 
normal and tangential displacement respectively along the orbit x = x(/), y =y(f). 
Every slightly varied orbit yields a normal and tangential displacement which are re¬ 
presented to terms of higher order by c$n, cSj where e is an infinitesimal; and con¬ 
versely. 

The precise relation of the equation of normal displacement to the orbits is es¬ 
sentially contained in the usual theorems of the Calculus of Variations. In fact it is 
well-known that the parametric problem of minimizing the integral 

leads at once to a set of extremals which arc precisely the orbits in the restricted 
problem.of three bodies for the given C. This becomes immediately obvious if the 
parameter t is so chosen that the integral relation (3) holds. The equation of normal 
displacement is the differential equation of Jacobi for this Calculus of Variations problem. 


§ 4- 

Levi*Civita’s Form of the Equations of Motion. 

The form of (1), (3) due to Levi-Civita results from the following transforma¬ 
tion of variables 

03) x — |i = />’ — ?’, y = 2pq , <// = 4 (p’ + f*)dT. 

This may be written 

i =K w ) = + u'N dt = z*(«orrfT 
where 1 = x -f- }/— 1 y, tv = p -f- \' — 1 q, and falls under the type of transformation 
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ia 

considered in $ 2. If applied to (i), (3) we obtain at once essentially the equations 
of Lbvi-Civita 

(0-8 ( P‘ + rtjf = [■*(“(/■■ - *Ff)- T C ) (P ‘ + ?,) ]/ 

(m) |j£ + »(/>* + f*) - ?'■ */*) - t c ) V + *■>], ■ 

in accordance with the results of § 2. 

By this transformation the point J goes into the origin in the />, tf-plane, which 
is not a singular point. In fact the term (i — jt)/r, or (i — u)/(/>‘ q‘) in 12, which 

becomes infinite, appears multiplied by a factor (p* -f- ?‘). 

At p = q = o the third equation (14) shows that the square of the velocity is 
8( x __ jx) for the possible orbits. Hence the curves of motion are analytic curves 
without singularity near the origin. 

In consequence of these facts it is apparent that the orbits approach a cusped form 
near collision with 5, and likewise with J '). 

The singularities of the transformed equations lie at 00 and at the two images of 
the point 5 in the p , f-plane. 


S 5- 


A New Form of the Equations of Motion. 


In the present paragraph we shall use a similar transformation to remove the 
singularities at both S and J [see '), Note]. The resultant form of equations yields 
essential aid in Part II, and is undoubtedly of definite theoretical interest. Nevertheless, 
in Parts III and IV of the present paper, the simpler equations of Levi-Civita turn 
out to be more convenient. 

In order to completely regularize the equations of motion, we proceed to effect 
the transformation of (1), (3) corresponding to 


(15) 


<=/(") 


-f pO — p) 

2 XV -j- I — 2 |A 


The transformation from ^ to w admits of the alternative form 


From the equation (15) we find at once 

, N (w — ny , (u/ +1 — 
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and also 

(. 8) /(«,)= £> . 

V y ' (2tt» -f I — *P) 

The explicit transformation from x, y , I to u, v, t is (S 2) 


(» 9 ) 




r = 7 «-T 


(»+T-p)‘ + «''’ 


= k— py+viK*+»-py-+«v 
4[(-+r- 


The resultant form of equations is then 

d‘u 
d 


(20) 

where 


i 7? g= a >- •* j ?+t ■ P f £ =u:(B> 

(«)’+ (£)’ == -•(->• 

(20 u . ( ,o = ^[^±^ + 2 P> (^ + L^)-4-c], 

p, = >'(«—p) , +®\ p, = /(“+■ - k)‘+ v, p,=^(“+7-i*)‘-f 

To obtain these equations we apply the method of § 2 for >(x, y) = i and 
U(x, y) = li(x, y) —±C; we need only express \f(w)\ and U(x, >) —±C in 
terms of the new variables w, v, and substitute these values together with X(x, y)=i 
in equation (9). By (18) we find 

LT( u ')l = • 

Also by (17) we find 1 

'. = k — H = -£. r. = k+i—= 

From (2) and these last equations we derive 


“('■»)— T c - T + -.(^ + i) - T^ 

The points = p and w = — 1 -f- p correspond to the points { = p and 
l = — 1 + p respectively. Now 12 *(«, v) is analytic at these two points in the u, 
v-plane. The coefficients of du/dr and dv/dz in the equations (20) are also analytic. 
Thus the new equations have an ordinary point at (p, o) and (— 1 -f- p, o) in the 
u, v-plane corresponding to (p, o) and (- 1 + p, o) in the x, y-plane. The nature 
of the solutions at these points is given by the customary fundamental existence theo¬ 
rems for ordinary differential equations. 

The same examination shows that the only finite singular point of the new equa- 
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.ions is at (- t + V, o) when p, is zero. This point to = - 7 + H, and also w- co, 

correspond to X. = . ...... 

The equations (20) remain regular so long as tlx particle P is not rejected to inan¬ 
ity in the x, y-plane. . , 

The importance of the new form of equations consists in the fact that «, v, 
u' = du/dr, v’ = dv/d t form an adequate set of coordinates of the motion under 
all circumstances. Any set of values v, v, u', v' satisfying the relation 

(22) „'« + **«- 2fl«(M) = 0 

will be termed a slate of motion. 

For each state of motion we have a unique set of values x, y, dx/dt, dy/dt 
satisfying (3) provided that the point (x, y) does not coincide with S or J. In these 
exceptional cases it has been noted (§ 4) that the motion is represented by a cuspcd 
orbit in the x, y-plane with cusp at S or /. To each u\ v' corresponds a possible di¬ 
rection of the axis of the cusp. 

Wc shall agree to refer the state of motion to the x, y-plane in which the par¬ 
ticle moves, and wc shall proceed to establish that for a given state of motion there 
exist always two and only two distinct sets of values u, v, u\ v‘. 

If wc denote by tv, and w t the two values of w for a given Xj there subsists 
between these values the relation 

( a} > 2 -« 1 + «-V“*.+ , i-i‘ =B0 - 

This relation may be obtained directly from (16). The transformations from x to tv, 
and tv, are analytic. 

Let t, and t, be the variables r corresponding to tv, and tv, , derived by the 
method given in § 2. In accordance with our previous notation it is natural to write 

w t = u, -f- }/ — iv,, tv, = u M -f- Y — 1 v, 

and also _ _ 

dwJdT' = u\ -f V— 1t/,, dwjdx t = u\ + )f— 1 v',. 

These transformations from x to u» ( and tv, lead us of course to the equations 
(20), (21) in which u, v, r arc replaced by v,, r and m,, v,, t, respectively. 
The relation between t, and t, is furnished by the equations 


But we have identically 
so that 

and thus we find 

(24) 

where 


* = iftor*. = irwr'v 

f(w,)dw t = f(w_)dw„ 

_/w 
I ~ \ fm ‘ ’ 

w[ = d wjdx lt w\ = d wjd t, . 
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The equations (23) and (24) give explicitly the relation between the two sets 
v, t u\y v\ and 11 1 , v t , u', v\ which correspond to the same state of motion. 

It is evident that these sets arc distinct as long as u.\ and w t arc distinct. Now 
tv, and iv t only coincide at p or — 1 -f p. If tt-, approaches w t in this way we 
obtain 


lim 




dw. 


from (23). Hence we infer that 




*4 = lim -t—-* = — 


dw 


w\ -f- w\ = o. 

If w[ and w\ were equal they would reduce to zero for w t = w t = p or — 1 -f- p, 
in contradiction to (22). This establishes that the two sets arc distinct in all eases. 

We note in passing that the analytic transformation from w t to xu t is of the 
type considered in § 2. It may be verified without difficulty that this transformation 
takes the equations (20) into themselves. At the same time it may be observed that 
the non-ar.alytic transformation which replaces m, v, t by u, — t», — t also leaves 
the equations invariant in form. 


Part II. 

ANALYSIS SITUS OF THE MANIFOLDS OF STATES OF MOTION. 


S6. 


Geometric Representation. 


We arc now in a position to form a satisfactory geometric representation of the 
totality of motions given by (1), (3). Conceive of m, v, u\ v' as the rectangular 
coordinates of a four dimensional space, and consider the three-dimensional manifold or 
manifolds (22) lying in that space. These manifolds comprise all of the points repre¬ 
sentative of possible states of motion, and no others. 

The three-dimensional manifolds of slates of motion are without singularity unless C 
has one of five particular values. 

In fact, since the left-hand member of (22) is analytic for any conceivable set of 
such values, it follows that the equations for a singularity are 

(25) «* = 0, v' = o, ir(n, v) = o, a;(«, v) = o. 

The first two equations imply that U(x, y)—\ C is zero at the corresponding point (x, y), 
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since th^ function Q*(«, v) docs not vanish at 5 or / and can vanish elsewhere only 
when 12 (x, y)—;C vanishes. The last two equations imply that 12 (x, y)=Q > (x, 

These last equations may be written [sec (2)] 

(«, 0. *—=* 

(26) / ,_u u\ 

(*,(*, y)^y(>- -7j -7r) = °- 

It is verified without difficulty that there arc five solutions (x, y): 

;p,^(x,,0), P, = (x„ o), P,=(x„o) 

(27) /P. = (-T + P. » / I). P, = (-T + IS-T^) ”)• 

Thus there is a singularity of such a manifold only when C has one of the five 
values necessary to make Q(x, y) — jC vanish at P., P,, P,. P, or P,* At lhcsc 
values, for the corresponding u, v and for u' = v' = o, two of the manifolds (22) 
are about to unite or separate. We shall exclude these values of C from consideration, 
so that in all remaining cases the manifold is non-singular and any small part of it 
may be represented in ordinary space. 

To each point of a manifold of states of motion there trill correspond one and only 
one curve of motion. This will pass through the point with a definite direction which is 
an analytic function of position in the space. 

In fact, we have from (20) 

dii _dv _ _ du' ___ dV _ 

M ' v ' 7 rielf ?* 1 + °!( m » v ) -+ *?(«• v ) 

Our statement is certainly valid unless all four denominators can vanish simultaneously. 
In this case four equations, essentially the same as the four equations for a singularity 
of the manifold, arc obtained. These have been proved inconsistent unless C has one 
of the five excluded values. 

The totality of states of motion may thus be thought of as given by the steady 
motion of a three-dimensional flow occupying a non-singular manifold in four-dimen¬ 
sional space. As r changes, each point moves along a non-singular stream line which 
represents the orbit in the x, y-planc. 

This motion leaves unaltered the volume integral 

r r rdud v (u'dv‘ - -sdn) 

(28) JJJ ») 

extended over a part or the whole of the three-dimensional manifold (22). For, it we 
write <1* = arc tan d vd n and employ (22), it is clear that the above integral becomes 


j* JJ' dud vd if. 


IO ) Cf. C. L. Charlier, Die Mfcbjnik Jet II,mauls (Leipzig. Veil, 1907), vol. II. pp. 102-117. 
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Now our equations of motion (20) may be written [compare with (5')] 
d ~ = Yaar(», «.)cos+ = U{u, v, +), 
dv 


0°) {dT = v ) sin + = K (“» v » H 

4. - tQsinl + u; (m, t;) cos ^ _ , 

^ ^ >; + ^u-(«, v) _ ’ ’ n 

But we have 

^(«» V, *) + V.(H> V, +) + **<“» V, t) _ O, 
so that the integral (29) docs remain invariant. 


$7- 


On the Regions of Motion. 


It is of primary importance to determine the nature of the three-dimensional man¬ 
ifolds of states of motion. Prior to doing this we need to consider the nature of 
the regions 12 (x, y) — 7 C ^ o in the x, y-plane, and of the corresponding regions 
o in the u, v-plane in which, by (3) and the last equation (20), motion 
is only possible. The essential facts concerning the regions in the x, y-plaoe are well- 
known and will only be outlined. 

We have observed [see (26) and (27)] that the simultaneous equations Q,(x, y) = o, 
U, (*» y) = o arc satisfied at five points P, , P,, P,, P % , P i of the x, y-plane. The 
first three of these lie on the axis of x on the three segments into which S and / 
divide that axis. The last two points form the remaining vertex of the two equilateral 
triangles which have S and J as vertices. Also 12 (x, y) becomes infinite at S, at /, 
and at infinity, and is everywhere positive. 

Consider the family of algebraic curves 12(x, y) — -J-C = o which form the boun¬ 
daries of the regions under consideration. For each pair of points (x, y), (x, — y) we 
get one and only one value of C. Therefore if C be made to decrease from -}- 00 
to zero, each member of the family of curves generated is symmetric with respect to 
the x-axis, and one of the curves passes through each point of the x, y-plane. 

For C large and positive we obtain obviously two small approximately circular 
ovals about S and /, and a large approximately circular oval containing 5 and /. For 
such values of the constant C the motion takes place cither within ouc of the smaller 
ovals (Case I) or outside of the large oval containing S and / (Case II), inasmuch 
as 12(x, y) - jC must be positive by (3). 

As C diminishes, the small ovals expand and the large ouc contracts until the 
two small ovals unite at P t along the axis. 

In fact, it can be established without difficulty that at P 4 and P s the function 

On. Pst„~. 1. XXXIX ('O Mm. 191 j). - St.^p.to II 9 m. w io 191 {. , 
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li(x, y) has its absolute minimum. It can also be established that the absolute minimum 
of 12 (x, y) within the segment 5 / of the x-axis exceeds the value of 12 (x, y ) at the 
same distance from S or J but outside of this segment. Consequently 12 (x, y) has a 
greater value at P, than at P , P„ P , P, f and the two smaller ovals will unite at 
P as stated. 

Within the double oval about 5 and /, so formed, motion may take place (Case III), 

At the next stage the large outer oval and the double oval unite at P, (or P,) 
into a horse-shoe shaped oval cutting the x-axis twice to the right of J (or the left 
of 5 ). Motion may take place outside of this oval (Case IV). 

If C be decreased further, this oval breaks up into two symmetrical ones on 
opposite sides of the x-axis by passage through P, (or P,), and motion takes place out¬ 
side of these two ovals (Case V). 

These two ovals diminish to the points P 4 , P , and vanish, as C decreases further 
to 3. For C < 3 motion is possible throughout the plane (Case VI). 

These are the well-known facts which we shall need to use "). 

Let us now inquire into the nature of these regions of motion as represented in 
the m, v-plane. The transformation from the x, v-planc to the k, v-plane is given by 
(16). This equation shows that a suitable representation of the transformation maybe 
obtained by taking a Riemann surface of two leaves in the x, y-plane with branch 

points at 5 and /, and making one value of w correspond to each point of this sur¬ 

face. 

It may be observed at once that the ovals in the 11, v-plane arc symmetric with 

respect to the real axis, inasmuch as the ovals in the x, y planc arc symmetric with 

respect to the axis of reals in that plane, and conjugate values of * yield conjugate 
pairs of values of w. 

In Case I the region of motion including J may be deformed into a small oval 
about / in this Riemann surface without passing over the branch points 5 and /. 
Hence in the m, v-planc the region of possible motion will be the interior of an oval 
about w — (A. 

In Case II the region of motion may be deformed (under the same restriction) 
into the exterior of two superimposed large ovals about ; = c© including 5 and /, 
one in each leaf of the Riemann surface. These regions correspond to the interior ol an 
oval about xu = - — p and to the exterior of an oval about w = 00 in the », v-plane. 
The motion may be represented in either region. 

In Case III the region of motion may be deformed into the interior of two 
large superimposed ovals about ^ = oo in the Riemann surface. In the m, v-plane the 
region of motion will accordingly consist of the ring lying between an oval about 
tt< = — -j- >x and an oval about to = co. The points w = — 1 -j- y. and to = ft 

lie one on each of the two segments of the «-axis within the ring. 


*•) Loc. cit. ,0 )- 
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In Case IV wc have two superimposed ovals in the Riemanv surface, each of 
which can be deformed to a point. Hence in the m, r-planc there will be two ovals. 
Since, at the two intersections of the oval with the x-axis in the .v, y-planc, the ratio 
U +1-.“)/U-i*) is real and positive, each of the ovals in the «, v-plane intersects 
the real axis twice. The fraction (tt* -j- I — p-)/(s«* — p) is positive for one oval and 
negative for the other; the first oval cuts the Maxis to the left of w = — i -f- y., or 
to the right of u/ = p, and the second oval cuts the M-axis between te = — I -f-p. a *>d 
xc = u. on the same side of w = — ] -f- p. The region of possible motion is the 
region exterior to these ovals. 

In Case V the two ovals in the x, y-planc will yield four ovals superimposed in 
pairs in the Rikmann surface, all of which can be deformed to a point. The region 
of motion in the 11, v-plane will therefore be the part of the plane exterior to four 
ovals. Inasmuch as the ovals in the x, y-planc do not cut the real axis, the ovals in 
the /i, v-plane will not cut the real axis, and the four ovals are two-by-two symme¬ 
trically placed with respect to that axis. 

In Case VI motion is possible throughout the w, v-plane. 


$8 

A Problem in Analysis Situs. 

We arc now in a position to consider the nature of the three-dimensional mani¬ 
folds ol states of motion, and wc begin with Case I. The point (m, v) is restricted to 
lie in an oval about / in the it, v-planc. Any such pair of values it, v together with 
values it', v‘ subject to (22) yields a possible state of motion. 

But to any point («,, v,) in this oval there is a point (u t , v t ), also within it, 
such that the two associated points in the Riemaxn surface given by (16) lie one 
above the other. The corresponding values w ( and tv, of w arc related by (2j). This 
equation makes it certain either that one of the points («,, v ( ), («,, v,) lies in the 
upper half plane and the other in the lower half plane, or else that both lie along 
the real axis. 

Let us therefore confine ourselves to the upper half plane v ^ o. By doing so 
we obtain a single representation of each state of motion for v^o. A double repre¬ 
sentation of each state of motion v = o will be obtained. 

Now, given u, v^o, and u' such that \u\Z.\'2 il*(n, v), there will correspond 
two values of v’ equal in magnitude and opposite in sign such that u, v, u\ v' satisfy 
(22). These values of v' coincide and become zero for |m*| = /2U*(u, v). 

Thus the manifold in question (v ^ o) is in two-to one continuous correspon¬ 
dence with the three-dimensional manifold 

212* («, v) ^ o, v ^ o, - 12 U- [i, v) ^ It' £ \’i UX^T). 

The two values coincide along the surface |m'| = j'212*(m, v). 
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Taking into account the fact that ^2U'(h, v) is positive within the oval and 
zero along its boundary, wc see that the above w, v, //'-manifold is in one-to-one con¬ 
tinuous correspondence with the n, v, //'-manifold 

, _ X o, V ^ o, - f I — u'-* ^ Kl ^ “• - 

If «, v, m' be taken as rectangular coordinates of a point, this represents a hemisphere. 

The manifold of states of motion which we arc considering (v ^ o) and the 
points of this hemisphere, doubly taken except along the spherical surface, are in one- 
to-one continuous correspondence. The states of motion v = o will be represented by 

points of the plane surface v = o. 

It follows that the states of motion in question arc in one-to-one continuous cor¬ 
respondence with the points of a sphere. This sphere may be conceived of as formed 
from the two hemispheres by reflecting one of the hemispheres in its spherical boundary, 
leaving points of that boundary invariant, and by effecting a suitable distortion. 

The bounding surface of the sphere so obtained is compounded of two hemispher¬ 
ical parts coming from the doubly taken plane boundary of the hemisphere. I hese 
parts are in one-to-one continuous correspondence with the manifold u, v = o, /<, 
v‘ ^ o and the manifold «, v = o, u', v" Z o respectively, and have in common 
the bounding circle for which t»' = o. 

If we employ the notation of 5 5 to represent the pair of points (//, v, u‘, v ') 
giving the same state of motion, it is dear that t v t and w % arc real for v = o; also 
from (23) it appears that the ratio of w\ to w\ is real and negative in this case. 
Hence each point of one part of the bounding surface is paired with a point of the 
other part of the surface which represents the same state of motion. Along the 
common circle, a point // ( , v, = o, mJ, v' = o will be paired with a point i/ Jf v, = o, 

v\ = o of the same boundary, and no pair of points coincide in one (sec § 5). 
This property shows that the pairing along the boundary is essentially equivalent to 
that obtained by pairing opposite points of the circle and may be reduced to it by 
distortion. 

By a second distortion each point in one bounding hemispherical surface of the 
sphere may be made to lie diametrically opposite the point with which it is paired in 
the other hemispherical surface. 

In Case I, when the motion takes place in an oval of zero velocity about J, the 
states of motion are in one-to-one continuous correspondence with the points of a sphere 
on which opposite points are taken as identical. 

Let xj x\ xj = t be the equation of this sphere, where x,, x,, x, are rect¬ 
angular coordinates; if then we write 


x = 


> T-i » 


+ *: + *; 


x. = 






,/ x ; + xl + *; ’ 


the points x ( , x t , x ( and x ( , x lf x ( of the space of inversion are so paired that the 
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interior of the sphere corresponds to the exterior of the sphere, and opposite points 
on the sphere arc paired. It follows that the manifold of states of motion in Case I 
is in two-to-one continuous correspondence with the points of the space of inversion ,a ). 

In Case II it has been noted that for any point in the oval in the x, y plane 
there is but one point of the oval about w = — 7 + K in the u , v-planc. Moreover 
the point w = — 7 + |* corresponds to the singular point * = 00. By (22) the man¬ 
ifold of states of motion will be in one-to-one continuous correspondence with a 
manifold 

tt'* + V** = I — M* — V* 


from which the sets of values 11 = v = o, corresponding to w = — .1 -f- are 
excluded. But this is the surface of a hypcrsphcrc in four-dimensional space from which 
the closed line n = v = o, 11" -f- v‘‘ = 1 is excluded. Either this fact or a direct 
analysis of the n, v, u\ tr'-manifold lead' to the conclusion: 

In Case //, when the motion lakes place in the exterior of an oval including S 
and /, the slates of motion are in one-to-one continuous correspondence with the points of 
the space of inversion exclusive of a closed singular line whose neighborhood corresponds 
to the neighborhood of infinity in the x, yplane. 

By an expansion of the singular line into a closed tube and an inversion of space 
about an interior point of the tube it is possible to obtain a representation in the in¬ 
terior of an anchor ring whose boundary corresponds to the neighborhood of infinity. 

It would have been possible to obtain the results for Case I somewhat more 
simply from the equations of Levi-Civita, and for Case II from the equations (1). 

To deal with Case III we proceed much as in Case I. As in Case I we have 
two points in the three-dimensional manifold 11, v, iT, v' representative of each state 
of motion. This appears from the fact that superimposed points within the double 
oval in the Riemann surface correspond to distinct points in the single ring-shaped 
region in the m, v-planc. 

Moreover, as before, we obtain a single representation of each state of motion 
by taking v^o, with the exception of the states of motion v = o, which are doubly 
represented. For a given 11, v ^ o and u' such that |u'| ^ |^2 U*(m, v), we have two 
values of v' equal in magnitude but opposite in sign. These values merge and become 
zero for |u*| = /aU'(u, v). 

The function 12*(n, v) is positive within the ring-shaped region of motion in the 
n, v-plane, and vanishes along the two boundaries. Let u , v, v* be taken as rectang¬ 
ular coordinates of a point. If we bear in mind the condition v ^ 0, it is clear that 
the states of motion under consideration are in two-to-one continuous correspondence 


,a ) Cf. H. Poincare, loc. cit. »), p. 199. Also 3 ), p. } 8o. Professor O. Veblen has called my 
attention to the fact that the space of projttlivt geomttry also affords an equivalent representing space. 
This is not, however, the space of the coordinates. 
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with ih, half of an anchor ring whose curved cor^ponds » 

and whose bounding meridian plane corresponds to v- • 

mcrce along the curved surface. _ 

This region nuv be deformed into .he region of space between two concent 
hemispheres so tha, the spherical surfaces correspond to the two sect.ons of the nchor 

ring by the meridian plane, and the plane boundary corrcsponds .o .he curveJ ound^ 

ing surface of the anchor ring. To each point of this hemisphere sohd corresponds 
two states of motion, which merge along the bounding plane. 

Hence, if we consider this solid as doubly taken, and make a reflection of one 
of the solids about this plane leaving the points of the plane invariant, we obtain a 
representation of the states of motion in the region between two concentric spheres, 
which is one-to-one and continuous save along the two bounding spherical surfaces 
where it is two-to-one. But these two surfaces are each of the same nature as the 
similar spherical surface in Case I. Thus wc reach the following conclusion: 

In Case III, when the motion takes place in the intenor of an oval of Tcro velocity 
containing S and /, the states of motion are in one-toone continuous correspondence with 
the points between two concentric spheres on each of which opposite points are taken as 

identical. . ... 

The original manifold u, v, u\ v' represents each state of motion twice. We may 

unite two solids each bounded by a pair of concentric spheres in such wise that they 
form a like solid and give a one-to-one continuous representation of the u t v, v' 
manifold. Similarly placed points on the two bounding spherical surfaces arc to be 
taken as identical. A complete representation of the motion in ordinary space is obtained 
by joining an infinite number of such solids so as to fill all of space except the origin. 
An infinite set of points will, however, correspond to the same state of motion in 
this representation. 

With the aid of entirely similar methods the remaining cases can be discussed. 

In Case IV, when the motion takes place outside of an oval containing neither S 
or /, the states of motion are in one-to-one continuous correspondence with the points 
between two concentric spheres on each of which opposite points are taken as identical, 
provided that two oppositely placed non-intersecting closed lines on one of the spheres are 
excluded. The neighborhood of these lines corresponds to the neighborhood of infinity in 
the x, y-plane. 

The treatment of Case IV is of course based on the nature of the region of 
motion in the u, v-plane. This region is a ring from which two points w = — 7 -f {*, 
w = oo are to be excluded. The treatment can be made essentially as in Case III. 

In Case V, when the motion takes place outside of two symmetric ovals containing 
neither S nor /, the states of motion are in two-to-one continuous correspondence with the 
points of a right circular cylinder from which two smaller circular cylinders with parallel 
axes have been removed. The two states of motion merge along all the boundaries save 
the lateral boundary of the given cylinder. Along this boundary two points representing 
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slates of motion from the two different sets are taken as identical if they lie along gene¬ 
rators symmetric with respect to a fixed plane through the axis, and are at equal distances 
from opposite bases. The neighborhood of two symmetric singular generators corresponds 
to the neighbor Itood of infinity in the x, y-plane. 

I he fairly complicated geometrical representation in Case V is obtained by con¬ 
sidering the states of motion v^o. If n, v, arc taken as rectangular coordinates 
ot a point in space, we obtain at once the representation given. 

The stated correspondence in pairs of the states of motion v = o, which yield 
the generators of the given right circular cylinder, is easily verified under the following 
conventions. To each generator corresponds a single u (inclusive of the singular gen- 
crators u = — -i -f « = <*). Let y denote t h c ang i c between the «-axis and the 
direction of motion at (/#, o) in the «, i-plane. Thc ends of thc doubly taken gen¬ 
erator (v' = o) correspond to y = o, y; = r and to y = r, y ; = 2 r.. 

Now thc values of « for v = o arc paired by (25), in such wise that as one 
point («, o) goes from 5 to / along the segment SJ of thc /taxis, the other traverses 
the two outer segments Soo, and 00/; thc two points (11, o) coincide at 5 and /. 
Bv a suitable deformation corresponding paired generators may be placed symmetrically 
about a plane through thc two generators ns — 1 -f- y. and u = p. 

Paired generators yield thc same point in thc x, y-plane. In order that thc values 
of y for paired values of (n, o) give thc same state of motion, these values must 
differ by This conclusion results at once from thc fact that the transformation from 
one value ol w to thc other is analytic, and thc further fact that associated points arc 
arranged in opposite order on thc line v = o in thc //, v plane. 

Hence thc correspondence along thc generators is essentially as stated. 

hi Case T/, when the motion is unrestricted, the states of motion are in one to-one 
continuous correspondence with the points of an anchor ring on which points symmetric 
with respect to the center are taken as identical. The neighborhood of one of the two 
singular closed lines forming the intersection of the anchor ring and the central plane 
corresponds to the neighbor Itood of infinity in the x, y-plane. 

In (..isc \ I. as in all thc other cases, we consider thc states of motion as repre¬ 
sented by coordinates //, r, it* for which we have t>^o. This yields two states of 
motion for a given //, t\ //*, merging for |n'| = v). 

Since u and ;• are unrestricted except for thc condition v ^ o we obtain a two- 
to-one representation by a right circular cylinder as in Case V,'except that here no 
part ot thc cylinder is removed. Moreover thc correspondence along thc generators 
is thc same arc in Case V. 

The representation by a double cylinder so obtained is clearly equivalent to thc 
representation stated. 


The results above show that thc restricted problem of three bodies presents fund¬ 
amentally different aspects for different values of the constant of Jacobi. 
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There are other problems of analysis situs, such as a partial class,fica on of he 
possible types of periodic orbits, which can be based on the above results. But the 
conclusions to be obtained are not sufficiently definite for our P^P 0SCS ' 

ihc remainder of the present paper I shall confine attention to Case I O he 
other cases, only Case II appears not to present difficulties beyond those of Case I. 


Part III. 

INVESTIGATION OF POINCARE'S RING TRANSFORMATION. 


S 9 

Representation of the States of Motion for p - o. 

The greater simplicity in Case I is closely associated with the fact that here we 
have an approximation to the two body problem, a. least for u sufficiently small or 
for C sufficiently large, let us therefore commence with a consideration of the extremely 

instructive limiting case |* == o ,3 > ... 

The body / is of mass . and lies at the origin m the .v, jr-phne while the bod> 

5 is of mass zero and lies at (—1,0). 

The differential equations and the integral of Jacobi take the form 

0.) 

r‘ + 4-“ C . 


(30 


V* = U(r) 


where we have written r = l .V + v\ and where v denotes the relative velocity of P 
in the rotating .v, vplanc. These equations arc directly integrate. In fact the particle 
will move attracted to the tixed body / of mass unity according to the Newtonian 
law, and its relative motion in the .v, y-planc will be in a come section with focus 
at J which appears to rotate in a negative sense with unit angular velocity. 

If we restrict attention to the case C> J there will be two circles ol radii r < 1 
and r" > 1 along which the relative velocity will be zero. The particle may move 
within the smaller circle or without the larger circle. These facts come at once Irom 
the equation (32). We will confine ourselves to the first case, in which the particle 


■ 3 ) Much of the material presented in this and the following paragraph .s given in a 
form by PoixcarE : sec a ). 


different 
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necessarily moves in a rotating ellipse, whose semi-major axis and eccentricity we will 
denote by a and e respectively. 

II h measures the angle between the radius vector from the J to P and the posi¬ 
tive direction of the x-axis, the same angular coordinate relative to a fixed line through 
/ is 0 -f- /. Thus, by the law of areas for elliptic motion, we get 

(H) ' , -j i (.* + 0 = ±Yi(^7% 

since the mass of / is unity. The positive or negative sign is to be taken according 
as the motion is direct or retrograde with respect to fixed axes. At an npsidal passage 
wc have dr/dl = o. Hence the relative velocity v in the x, y-plane is rdh/dt. If now 
we note that at a lower apsis we have r = a(i — c), the above equation yields at once 

04) ’ = ±14(7-Tj—(«-*>■ 

Substituting this value in (52) and simplifying, we find 


( 35 ) 


= —tL -f- — }/a. 
2 \'a 2 


V - 

The quantity b here introduced designates the semi-minor axis taken positively or neg¬ 
atively according as the motion with reference to fixed axes is direct or retrograde. 

The relation (jj), which imposes a single condition upon the elliptic motion, 
yields us all possible types of motion for the given value of C, and no others. In fact 
by retracing steps it may readily be shown that if (35) obtains, the relation (32) holds 
true at lower apsidal passage for the given C and thus holds always for the same C. 

The function a) forming the right-hand member of (35) is negatively infinite 
for <7 = 0 and is zero for a = 1 C. Its first derivative as to a is constantly positive 
and us second derivative is constantly negative. Furthermore for a = 1 we find 
b= ,(C— i)> 1 . Hence the curve b=y(a), where <1 and b arc taken as rcctang- 



R.Hl. C.u. \Uttm. I. \X\I\ (,o _S..« P ..o U 9 n . uM „,j. 
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2 6 ____________ 

uhr coordinates of a point in the plane (fig. 2), has as asymptote the negative i-axis 
As a increases, b increases. The curve is constantly concave toward the a-axis, 

crosses the line h -- once, and the line * = « twice .,nce for a < . and once 

for a > I. We will denote the values of a < 1 for which b = - a and * - + 

b v a and a, respectively. 

The part of the curve a, ^ a ^ a, for which \b\^a comprises all points cor¬ 
responding to a possible elliptic motion which lies within the oval - = ,• and no 

others. In fact, if a < I such a motion cannot lie wholly outside of the oval r - r ; 

and if a > > it cannot lie wholly inside of the oval r = r'. 

It is therefore obvious that the totality of orbits to be taken into account cor¬ 
responds .0 the range a, ^ a ^ a, giving a segment MN of the curve where M 
and N arc the first points of intersection of the curve with b = -a and b =a 

A circular motion is characterized by the condition b = ± a and we will have 
the + sign if the motion is direct with respect to fixed axes, and the — sign if it is 
retrograde. Since the particle P lies nearer to / than S docs (r ^ r’ < i) the mean 
angular velocity of P with respect to fixed axes will exceed that of S. Consequently 
according as we have the + or - sign the motion is direct or retrograde with 
respect to moving axes also. 

The form of the curve b = P(n) shows that we have two and only two possible 
circular motions, one retrograde of radius a x and the other direct of radius a t . The 
first corresponds to the minimum value of a and the second to the maximum value 
of a. 

The fact that the segment MN may be described continuously by a point passing 
from M 10 N indicates that the possible types of motion may be arranged in a contin- 
uously varying series beginning with the direct circular and ending with the retrograde 
circular motion. The value i/C of n corresponds to an orbit of ejection which is thought 
of as continued analytically after the collision. 

From (35) we find at once 

00 i *--*+**■ 

Also if * stands for cither a, or a , we find that 

C=±2\fc+±, 

and hence that 


dC _ t 
d* ~~ ± 1 /, 


w — 

This quantity is negative for « < i. Thus we can state the following conclusion, of 
service to us later: 
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As C decreases from co to j tlx radii a, and a t {a t < a < 1) of the retrograde 
and direct circular or hits increase from o to ] and 1 respectively. 

We slull now inquire into the structure of the three-dimensional manifold of 
states of motion, which, by the results of 5 8 , is equivalent to a sphere on which 
opposite points arc taken as identical. 

A satisfactory set of variables in this connection is furnished by the semi-major axis 
a of the routing ellipse, the longitude 0 of the line of apsides of the ellipse with 
reference to the x-axis, and the mean anomaly \ of the particle reckoned in the same 
sense as the motion. 

The variable a ranges from <i ( to a t determining a unique b, and thus the figure 
ol the ellipse and the sense of the motion. The variable H determines the instantaneous 
position of the moving ellipse in the x, y-plane. 'Flic variable determines the instan¬ 
taneous position of the particle in its ellipse. 

l-vidently any a, h, i determines a unique state of motion so long as a is not 
equal to one of its extreme values a t , a t . In this case 0 and i become indeterminate. 
Conversely, any state of motion not corresponding to one of the circular orbits yields 
a unique a, 0 , where it is understood of course that 0 and arc angular variables 
of period 2“ With these coordinates collision docs not present any singularity. 

Let us exclude for the moment the states of motion along the circular orbits, and 
take a , 0, as cylindrical coordinates of a point in space. The totality of states of 

motion is then represented in a onc-toonc and continuous way upon the interior of 

the hollow cylinder <*, < a < a t , o Z If Z 2-. Opposite points on the two bases 
•' = o and ’} = 2 - represent the same state of motion. This solid is therefore of 
the same nature as an anchor ring. 

As t changes, the variables a, 0 , y change in accordance with the equations 

07) " = ° = + = *„"^ + +o 

in which a 0 , 0 # , stand lor the values of a, •!» at / = o. These relations are 
obvious from the definitions of a, 0 , and from the fact that the mean angular ve¬ 
locity n ol P fulfills the relation m’<T = 1, the mass of J being unity. 

Hence we sec that the curves representative of motion in a } t^-space are the 

_ j. 

spirals on the cylinders a = constant which have pitch — a * . 

Along the boundary of the cylinders the representation fails. To make plain what is 
happening here, wc observe that a given state of motion on the direct circular motion 
determines 0 -f y although it determines neither 0 nor y. The spirals 0 -f i = constant 
on the outer bounding cylinder correspond accordingly to the individual states of 
motion along the direct circular orbit. Likewise the spirals 0 — } = constant on the 
c r correspond to the individual states of motion along the retrograde circular 

orbit. 
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$ «°- 

The Ring Transformation for y = o. 


In the special case y = o, and also in the general case y ^ ** r * s * ria * 

problem of three bodies admits of a species of reduction, observed by Poimcar k ), by 
which the representation as a steady flow in three dimensions is replaced by a repre¬ 
sentation depending on a discrete transformation of a two dimensional ring into itsoll. 
We will treat the extremely instructive case y = o in the present paragraph. 

Let A’ be any point of the ring $ = o in the representing cylinder (5 9). ai,J ,ct 
l be the point in which the curve of motion through K first meets the ring \ = 2~. 
Corresponding points of these two congruent rings represent the same state of motion. 
Thus the transformation which takes K to L defines a transformation T of the ring 
y = o into itself, % 

(j8) «' = “< *' = • —a«i‘, 

where a, 0 , o art the coOrJinatej of K, and a\ O', ir arc the coordinates of I. 

[sec (37)). . 

To the essential properties of this transformation (which is a rotation) there cor¬ 
respond properties of the curve of motion, and vice versa. For instance, if the motion 
is periodic, the point K is taken into itself by a certain number 4 - of repetitions of T. 
If at the same time 0 diminishes by 2 lr. t we find, as the necessary and sufficient 
condition for this type of periodic motion, 

1 

ft' 1 ' = 6 — lr.ka* = H — 2 /*. 


Here we let V" denote the value of 0 after k repetitions of T. We thus obtain 

k 'l = <1 7 as the condition upon a which makes the motion periodic. Inasmuch as a 
varies between a % and a t , any k/\ is possible for which 

*7 <w< *7- 

The detailed treatment of the periodic orbits in this case offers no difficulty. We 
content ourselves with noting two salient facts. The quantity k I is the mean angular 
velocity of the particle in fixed space referred to the angular velocity of the moving 
axes as unit; k circuits of the apparently rotating ellipse are made while that ellipse 
undergoes / complete revolutions. After a single circuit of the ellipse the line of apsides 
has rc v essed through an angle 2/-/*. 

The transformation T admits of a simple geometrical interpretation in the .v, 
y-planc. To obtain this interpretation we first observe that the condition ■!» = o cor¬ 
responds to lower apsidal passage. Now at apsidal passage the direction ot motion in 
fixed space is at right angles to the radius vector to the origin. The rotation of the 
x, y-plane is also at right angles to this radius vector. Hence the direction ol relative 
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motion is at right angles to the radius vector in the x, y-pknc at apsidal passage (the 
case of zero velocity is included). The converse is also seen to hold. 

To distinguish between higher and lower apsidal passage wc note that, for r>a it 
motion at right angles to the radius vector in the x, y-plane will necessarily correspond 
to higher passage inasmuch as the distance from the origin exceeds the maximum semi- 
major axis. Similarly, for r < <*,, motion at right angles to the radius vector must 
correspond to lower passage. 

Suppose now that we consider the continuous series of states of motion y = o, 
dx/dt = o in the x, y-plane. Here we let x increase from a t to r‘ with dyjdI > o, then 
decrease from r' to zero with dy/dt < o, and finally increase from zero to a t with 
dy/d / > o. It is clear that the scries of states of motion thus obtained is represented by 
a continuous line in the 11, r, u', v'-spacc of $ 6 or any equivalent space. For, at 
x =r r* wc have dy/dt = o as well as dx/dt = o, while at x = o wc have H = o, 

<J« = o; hence the apparently distinct scries merge at the extreme limits for x. 

As the orbit passes through this scries of forms, the states of motion at first 
corresponds to a higher apsis. This will not change to a lower apsis until the state of 
motion belongs to one of the two circular motions. This only happens after a has 
decreased from r' to a t . This part of the scries corresponds to higher apsidal passage. 

Likewise, if wc commence with the other end of the scries, wc are led to the 
conclusion that as x diminishes from a ( to zero and increases to a t we have lower 
apsidal passage. 

In other words, if wc imagine two scries of auxiliary circles drawn with J as 
center in the x, y-planc, the first of retrograde sense and of radius less than a t , the 
second of direct sense and of radius less than a l% the states of motion corresponding 
to lower apsidal passage $=o may be characterized as those for which the path be¬ 
comes positively tangent to a member of this double scries of circles. 

It is necessary to make the convention that the null circle r = o is a member 
of both families and is tangent to any orbit of collision through it in the direction 
perpendicular to the axis of the cusp at collision. To such a direction corresponds an 
ideal point of the null circle. 

When this convention is made, a complete representation of the states of motion 
^ = o will be given by the points of this double series of circles which is obviously 
of the nature of a ring. The ring may be regarded as given by a species of surface 
of two leaves in the x, y-planc. One of these leaves extends to the circle a = a it the 
other to the circle a = a t ; and the two leaves are joined together at the origin along 
a single infinitude of ideal points as specified above. 

Accordingly this ring is in one-to-one continuous correspondence with the ring 
^ = 0 in the a, 0 , •} space. The explicit nature of the correspondence is obvious also; 
for, the lines 0 = constant correspond to the radial lines making an angle with the 
x-axis. If a varies from a t to a t while 6 is fixed, the point in the x, y-plane moves 
from the circle of radius a t to the origin and out to the circle of radius a i along the 
appropriate radial line; the distance from the origin for a given a is a — fV — b\ 
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Now let K be a point of the ring, represented in the x, y-plane, and consider 
the orbit in that plane positively tangent to an auxiliary circle through the point 
= o). The associated point L into which K goes by the transformation T is 
then the first subsequent point of positive tangency of the same orbit with one ol 
these circles = 2 r.). The specific form (j8) of the transformation shows that the 
successive points of positive tangency lie on one ’and the same circle r = constant, 
and are displaced through equal negative angles about the origin. 

Thus our results may be given in the following form: 

For y = o and C > 3 there exists a retrograde (ircular orbit of radius a t and a 
direct circular orbit of radius a i (a i <C a , <C 0 “bout J as center in the x, y plane. If we 
construct a family of concentric retrograde circles of radius less than a t and a similar 
series of direct circles of radius less than a t , these circles may be regarded as forming 
a ring of two leaves joined together at the origin in the x, y-plane , each leaf being 
given by one family of circles. For any point K of a circle of the ring we have a posit¬ 
ively tangent orbit which will again become positively tangent at a point L for the first 
lime. In this way a one-to-one continuous transformation of the ring into itself, talcing 
any point K into the corresponding point L, is obtained which leaves radial distances 

from J unchanged and regresses each point by a central angle 2-a‘ about J, where a 
is the semi-major axis of the ellipse oj motion of the particle. 

Let us visualize these results in the representing sphere of states of motion (§ 8) 
by associating with each point of the ring the point of the sphere which represents 
the state of motion of the positively tangent orbit at the point. 

To the direct and circular orbits correspond two stream lines traversing the 
sphere and joining opposite points of its surface. In fact, in the u, v-planc (§5) this 
orbit makes a complete cycle of J before re entering. During half of this cycle we 
have v^o and during the other half we have v Z.o. In the representing sphere of 
§ 7, therefore, the beginning and end of the first half cycle (which corresponds to 
a complete cycle in the x , y-planc) will not be represented by a dosed line in the 
ordinary sense but must be represented as stated. 

The intermediate circles correspond to a family of similar lines varying contin¬ 
uously from the first to the second line and forming a ring. 

iVo stream line is anywhere tangent to the ring. For, employing the coordinates 
*> y> ? (§ 0 f° r the or hit f° r the intermediate circles, we conclude that the 
equation of the ring is 

x cos 9 -f- y sin 9 = o. 

T hese coordinates fail for x — y = o and we exclude this case for the moment. If 
d x , dy, d 9 are the differentials of x, y, 9 along the orbit, the condition for tangency is 

cos 9 dx + sin 9 dy — xsin 9 dy -f y cos 9^9 = o. 

But, if we let s stand for the variable arc along the orbit and recall that the orbit 
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is positively tangent to the circle, we get from the preceding equation, 

£? __i__ ± i 

d s x sin 9 — y cos 9 r 

That is, the curvature of the orbit and circle coincide. This is impossible, for from 
this fact alone it would follow that the auxiliary circle itself represents a circular 
orbit for the given C. 

If we employ the regularizing transformation of Levi-Civita (§ 4) in order to 
consider the case x = y = o, the family of auxiliary circles with J as center is carried 
over into itself, while the orbits near J become approximately rectilinear. The curvature 
of the small auxiliary circles is large. Hence every orbit passing near J is positively 
tangent to an auxiliary circle once and only once. The stream lines must all cut the 
ring in the same sense for x and y small. 

The points which at t = o lie on the ring form a surface, which will traverse 
the whole of the sphere as / increases and each point moves along its stream line. In 
this way, when t has increased by a suitable amount, any point of the ring will be 
taken into another. The transformation of the ring so obtained is clearly the same as 
that given above. 

When account is taken of the fact that the circular periodic orbits vary analytically 
with p, considerations of continuity render the existence of a similar ring for p.^o 
highly probable. To preserve the essentials of the above construction it is only neces¬ 
sary to vary the ring in the representing space with ja so that it remains bounded by 
the two varied periodic orbits for the given C, and docs not become tangent to any 
of the stream lines. It is apparent that if such a ring is given it may be subjected to 
deformation. 

These considerations of continuity guided PoiNCAtfc to a large extent in his analytic 
construction of a similar ring. 

It may be stated in passing that an infinitude of essentially different rings can be 
formed. One of these, which has a particular importance, is introduced later (§ 19). 


ii. 


Analytic Continuation of the Circular Orbits. 


We shall establish that the simple and entirely explicit construction for the ring 
in the plane of the motion which has been obtained above in the case |a = o admits 
of complete extension to the case n ^ 0. 

The circles concentric with the origin in the x, y-plane which have been employed 
in this construction have also appeared as the totality of circular orbits for all greater 
values of C (§ 9). In generalizing the ring construction we shall make use of some 
properties of the analytic continuation of these orbits developed in the present and 
following paragraph. 
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We shall first demonstrate the existence of such a continuation by a method of 
more geometrical cast than a similar method developed by F. R. Moulton '♦). 

Consider the orbits belonging to the given value of C which cross the x-axis 
perpendicularly in the direct sense (dy/dt>o) at / = o, with abscissa nearly equal to 
a . Let the abscissa of crossing be denoted by x. The orbit will coincide with the 
circular orbit for x = a, and [i = o; in this case the first following crossing will lie 
a, x — _ a t and will be at right angles to the x-axis. If | (x) denotes the value of 
dx/di at this second crossing it is clear that 4 *(x) is analytic in x and {/.. This is an 
immediate corollary of the fact that, when two analytic curves intersecting at an angle 
different from zero depend analytically upon parameters, the coordinates of the point 
of intersection, the slopes of the curves at the same point, etc. vary analyticaUy with 
these parameters. The parameter along the curves is assumed to be regular. 

Hence we may write <l(x) as a power scries in x — a t and p, without constant 
term: 

♦M=*,(* - a ,) +p.f*+ y*( x - °y h— • 

If a, is not zero, that is if t/tydx^o for f* = o, the curve ^ = <J»(x) crosses the 
x-axis in the x, ^-plane at an angle different from zero for {* = o. In this event it is 
apparent that the abscissa of the point where the curve ; = y( x ) cu,s lhc xaxis va * 
ries analytically from — a, as u varies from zero. But the equation of this abscissa is 
v},(x) = o. This condition insures precisely that the orbit which intersects the x axis 
perpendicularly at x = a t in the x, y-plane, intersects perpendicularly at the first fol¬ 
lowing crossing. In this way we obtain the analytic and only continuation of the 
upper half of the circular orbit into another which intersects the x-axis perpendicularly 
twice in succession. 

The differential equations (i) and the integral (3) arc unchanged if x, v, t are 
replaced by x, — y, — t respectively. Therefore one concludes that this continuation 
together with its reflection in the x-axis constitutes a complete symmetric periodic or¬ 
bit, the analytical continuation of the direct circular orbit. 

It remains only to consider the condition *,^o which enters essentially into 
the above reasoning. To this end we consider the function t|/(x) for |* = o, which 
by definition denotes the component of the velocity in the direction of the positive 
x-axis in the rotating x, y-planc at the second crossing. Inasmuch as a change in the 
velocity of rotation about the origin at / docs not affect this component, y(.v) may 


1 «) F. R. Moulton. A Class 0/ Periodic Solutions of the Problem of Three Bodies with Application 
to the Lunar Theory (Transtctions of the American Mathematical Society, yol. VII (1906). pp. j > 7 * 5771 - 
The possibility of continuation was first established by H. PoincarE. Les methodes nonvelies de la Mi- 
cunique Celeste (Paris, Gauthicr Villars, 1892). vol. I. pp. 79-119. and somewhat later in a paper by 
T. LEVI-CiVITA, Sopra alcuni crileri di instability |Annali di Matematica, >er. Ill, vol. \ (1900). 
pp. 22I-J07J, in particular pp. 282-289. 
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be interpreted as the radial velocity toward / in the fixed plane at the time t t of the 
second crossing. 

The particle P in the fixed plane moves in an ellipse with focus at /, whose prin¬ 
cipal axis at the time of first crossing coincides with the x-axis since the particle crosses 
in both the fixed and moving plane at right angles. Let w denote the angle between 
the radius vector from / to the moving particle and the principal axis of the ellipse, 
taken in the same sense as the x-axis. Let e denote its eccentricity and a its semi-major 
axis. The equation of the ellipse is then 


( 39 ) 


r = «(» - o 

I i 1 cos tv 


where the -f- or — sign is required according as the first crossing corresponds to 
lower or higher apsis. From this equation and the above interpretation of «|>(x) wc get 

( 40 ) 


xrZ\ — dr \ _ T - **)sin ui dw 
d ‘ L, (liecosw ) 1 dt 


We wish to obtain dty(x)/dx for x = a t . In this case |(x) vanishes, while a 
and b reduce to a t for x = a t . Moreover wc have for any x 

«* = *, 

since the first crossing is at an apsis. If we write 

* = a a + Aa, b = a t + lb, x"=a, + Ax 
this relation becomes 

(»o -p t'(n - a b )(> *;+ Aii-f ib) = ax. 

But d b/d a is positive and greater than 1 at a = a t while A a is of course negative 
(see fig. 2). I he term Ad — A £ is therefore an infinitesimal of the first order in Ad. 
It follows from the proccding equation that Ad, A b and Ad -Ai arc of the second 
order in Ax, and that 

lim + = | im I 4 f = 


Ax 


Ax 


Dividing through the equation for •J*(x) by Ax and utilizing the fact just deduced 
wc find 


«J»(x) dw I 

*, = lint '.J- = d. sin w —! 

Ax * dt 


Here /* denotes the value of / ( at second crossing in the x, y-plane along the direct 
circular orbit 

Thus the coefficient x t vanishes if and only if the time /* required for the par¬ 
ticle in the circular orbit to pass from one crossing of the x-axis to the next is a 
multiple of the half period in the fixed plane (i. e. if w is a multiple of r). 

It wc had commenced with the retrograde circular orbit and had introduced the 

*•'*' Wl "“ «. \\\l\ o* IIUJ.- S«»m r ,.o U 1 Wffw ■ . 
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coefficient a , defined precisely as a, was defined, we should have been led to the 
same condition for the vanishing of 

Denote by n the mean angular velocity (taken positively) in the direct or retro¬ 
grade orbit relative to fixed axes. Relative to moving axes the mean angular velocity 
is then n Ip i, the upper or lower sign to be taken according as we have the direct 
or retrograde circular motion. The defining equation for f is (n + i)l? = ” and 
the condition that x t vanishes is that nt* = kr.. The condition upon n is therefore 
n __ k/(k — i). The lower sign corresponding to retrograde motion yields no pos¬ 
sible values of n which is positive by definition. 

The upper sign corresponding to direct circular motion gives us 2/1, 3/2, ... as 
the exceptional values of n. The relation n‘a\ = 1 together with the second equation 
(36) determine the corresponding values of C. We observe that the greatest value of 

n yields the least <*,, and thus the greatest exceptional value of C ($ 9). namely \')2. 

Our conclusion is that analytical continuation of the retrograde circular orbits is 
possible for any C > 3, while analytic continuation of the direct circular orbits is 
possible for any C not one of these exceptional values. 


S *2. 


Further Properties of the Direct and Retrograde Orbits. 


For our purposes it is essential to go somewhat further into the study of the 
analytic continuation of the circular orbits. We shall establish the following result: 

For C ^ C, > »'32, analytic continuation of tlx direct and retrograde circular 
orbits under consideration is possible for y sufficiently small and independent of C. The 
distortion of the x, y plane required to take the limiting circular orbits for u = o into 
their analytic continuation for the same C so that corresponding points have the same 
mean anomaly may be represented in the form 

, x Jp - = -M' —i* + f*p 7 (f. *> rt). 

) /r = e + KP - f (?, *, ,0. 

where p* f 0 are polar coordinates for y = o, and p", H’ are similar coordinates for 
v. 0, and where f and g are analytic in 0, 6, y and periodic in 0 of period 2 ~ for 
y sufficiently small. The quantities p, p' are taken positively or negatively according as 
(p\ *0* (?"» #') art point* °f a dirrfJ or retrograde orbit. 

Let us begin the proof by noting that the argument of § 11 shows that con¬ 
tinuation of the circular orbit is possible throughout any restricted range of values of 
C^C, if ft is sufficiently small. For, a ( and x, are analytic functions of y not zero 
for C in this range, and we may solve the equation i(x) =: 0 for x as an analytic 
function of a and C, for p sufficiently small, in consequence. 

In order to demonstrate the first italicized statement it suffices then to establish 
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that analytic continuation is possible for all values of C sufficiently large and for p. 
sufficiently small but independent of C. Our method will be very similar to that em¬ 
ployed in § xi. 

To do so we write X' = i iC and make the change of variables 
x — |i = V*, y = V„, / = Vt 

which reduce the equations (i), (3) to the form 

(«> S “ * v £ = + = 

where 

(44) U(5, ,wI[ r ^,,,w + 'T. + 1 ..-] + J-f , ^ 

1 re+v •'(i+vty+w 

To J in the x, y-planc corresponds the origin in the 5 , >j-plane. The transformation 
effects a translation together with a change in the units of length and time. 

The effect of changing X to — X in these equations in the same as changing r 
to — t. It follows that we may confine our attention to the continuation of the direct 
circular orbits for X positive or negative. The negative value of X for a given C will 
then yield retrograde orbits in the x, y-plane for positive X. 

Referring back to the interpretation of the case |i=o in the x, y-plane, we infer 
that, for f* = 0, the motion is in an ellipse 


( 45 ) 


=«( «- o 

1 i c cos w 


in the fixed plane. Here *, expressed in the same units as and e denote the 
semi-major axis of the ellipse and its eccentricity respectively; p denotes the radius 
vector from the particle to the origin in the same units, and w is the angle between 
this radius vector and the instantaneous position of the 5 -axis at t = o. It is obvious 
that the semi-axes a and £ of this ellipse in these units are X“* times the corresponding 
semi-axes a and b in the former units. By means of this fact we obtain the equation 


( 46 ) 



directly from (35). This is the necessary and sufficient condition that motion is pos¬ 
sible in a rotating ellipse of semi-axes r and £, for the given value of X. 

In the limiting case X = o, (C = co), ji == o the condition becomes a = 1 i. e. 
the ellipses have a unit semi-major axis but are otherwise unrestricted. There is then 
a circular orbit of radius unity. 

Now consider a series of orbits in the n-plane in which the particle is projected 
from the ;-axis perpendicularly at t = o with abscissa 5 , and let *(£) denote the 
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^•component of the velocity at the first following intersection with the 5 -axis ,s ). If 
__ o x = o, 5 = i we have x© = 0 inasmuch as the motion is in the circular 
orbit of radius i; and in the neighborhood of this set of values X© is analytic in 

„ 5 and may be written as a power series in ft, X, 5 — I. 

’ For X = o the equations (43) of motion reduce to the equations of motion for 
a particle at ( 5 , ») in a fixed 5 , n-plane attracted by a body of mass J — F at the 
origin. The series of orbits we are considering will be ellipses in this plane whose 
focus is at the origin and whose principal axis is the 5 -axis. We conclude that x (0 

is identically zero for X = o. 

Now the initial conditions at / = o are 


l = ?» * = °. 


dl = °’ 




these are analytic in 5 , p and X. Moreover in the differential equations every term 
which involves X involves X to at least the third power. Therefore if we put X = o 
in the differential equation alone and not in the initial conditions , the 5-component of 
the velocity at the time of second crossing of the x-axis will be affected only in terms 
of at least the third order in X. But this component becomes now identically zero. 
Hence x© contains no terms of less than the third order in X, and we have neces¬ 
sarily _ _ 

X© = x»[«,(5- I) + fr.P + Y,* + *,(*- 0* +•"]• 

The necessary and sufficient condition that the second crossing be at right angles 
is x( 5 ) = o. If the factor X 1 be removed, it appears that analytic continuation of the 
circular orbits into an orbit which cuts twice at right angles will be possible provided 
that a, is not zero. Also the equations (43) arc unaltered if 5 , H, * *>e replaced by 
^ - tl) _t so that orbits for which x(;) = ° y icltl symmetric periodic orbits in the 

5 , * plane. 

It is necessary then to evaluate To this end we put jx = o and write 
x T in order that the first term of the series for x(C) becomes the principal 
term. Thus we obtain 

(47) •i = K m i T» (5 = 1 +>'■ H = °> 

Since we have p = o, we find from (4;) [compare with (40)] 


( 4 «) 

where t ( is the time 


,7v _ T — » _ 

(1-tcosuO* 

ime of second crossing of the 5 -axis. But, as X approaches zero, 5 


i w t* w 


*S) It is obvious that xtf) = *'K*‘*') whet « I i$ ,he function defined ^ S "• 
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approaches i by (46), and the orbit approaches a circular orbit of radius unity; hence 
e approaches zero, w approaches r, and d w/d t approaches unity. 

Also we have by definition of l [compare with (41)] 

( 49 ) ^ « + V(A «~ A f)j 2 + A * + A pj = X \ 

where we have written a = 1 {i = 1 -f- A ^ =; 1 -f- ^ *. Now the equation 

(46) shows that A a is of the third order in X. Consequently from (49) we infer 

I 

that the radical Tf fV — {S’ or 4; at has the same principal part as X*. Combining 
these results we are able to write 


( 50 ) 


a. 



sin u» 

"SH™. 


The significance of w should be recalled. The particle P for p = o and the given 

X is projected from the point l = 1 -f-X*, n = o of the S-axis perpendicularly to that 
axis at r = o and crosses it again near to ; = — I, u = o. The angle w is the 
true anomaly relative to fixed axes at the instant of second crossing measured from 
the position of the ;-axis at r ■= o. 

If v be the mean angular velocity of the particle in the fixed plane its mean 
angular velocity relative to the moving ;, u-planc is v — X', since the £, nplanc rotates 
at angular velocity X'. Consequently, when the true anomaly is r. in the fixed plane, 
the corresponding angle is rr(i — X'/v) in the moving plane, and the difference is X'/v. 
Since this difference is of the third order in X, and since the difference remains ap¬ 
proximately unaltered through a small interval of time, it is apparent that at the time 
of the second crossing the value of xv is h(i -f- X l /v) to terms of higher order. In 
consequence the quantity sin xu has the principal part X 1 «. Returning now to (50), we 
find that a. has the value r.. 

Hence analytic continuation is possible for |X| C, *(i. e. C ^ C,) and p suf¬ 
ficiently small. It remains to demonstrate the second part of the italicized statement 
and this can be done very simply. 

According to what has been proved above we may write the equations of the 
family of periodic orbits in the form 


5 = A(\ p, r), , = B(X, ,s t), 

where A and B are analytic in X, p, t and periodic in t for |X| ^ C, * and p suf¬ 
ficiently small. For X = o the equations of motion have been noted to reduce to those 
obtained when a body of mass 1 — p at the origin attracts a particle at (£, ») in a 
fixed n-plane, in which case the family of periodic orbits above becomes circular, 
and the equations of the circular motion are 

5 = 0 — f)cos(i —p)-t, n =(i —p)sin(i — p) -, t, 


by the equations (43). 
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The variable X occurs to no lower power than the second in the differential 

equations and initial conditions. We infer that the above Rations may be written 
{= (l -riw('-Er*+E*/ 1 (».h’i ,=(.-|Osin(.-ri-T + V,,(>,!i,T) 
•Here f and g are analytic in their arguments and periodic in t. 

* for > = o the period T reduces to 2 *(t-p); the difference T-**(i-p) 
is analytic in X, p, and will contain a factor XV Now let us introduce the mean 

anomaly 


T* = 


2-7 

T 


which differs from (i - p)“* by terms of the second order in X. When t is ex¬ 
pressed in terms of t* in the equations for c, >. they retain a similar form: 

t = (, _ p)cos t* + X*/.(X, p, t*), n = (i - p) «■> «* + *'«.(». IS T ‘) 

These equations may be modified as follows: 


)'V + = i - ,4 + xy,(x, ?•), 


arc tan = t* Vgfa l*> T *)- 


Here / , g lt / , g t are analytic in all their arguments and periodic of period 2 r. in ?*. 
LcV o' denote polar coordinates in the x, y-plane with / as origin, so that 


p' = l^ , + /i , I 0 ' = arc tan y . 

The above equations may be written 

p'* = V(l - |* + >*/,(*> fc O). r = T * + V ^»C X » * T *> 

But for |i = o the values p*, 0 of p' 1 , O' reduce to 

p- = v (i + xy.o, o, t*)), o = t* + v/,(x, o, t-). 

Here p is really independent of t* since the motion is circular; and 0 is equal to v*, 
the mean anomaly. Taking note of these facts and solving for X, t‘, wc obtain 

* = p(, + p7«(p))» = 

where / is analytic in p. 

Let us associate (p, 0) with (p' t O') so that these correspond to the same value 
of X and t* (i. e. C is the same for the disturbed and circular orbits, and the mean 
anomaly is the same at associated points). If we eliminate X and 7 * by the aid of 
the last equations we get 

f ■ = f(/*—p + t’/fy, *. rt). »' = 9 + f‘r.to 9 - ^ 

But this reduces to an identity for p- = p, «' = » and p = o. We conclude that /, 
and contain a factor p. This leads us to the final form ( 42 ) of equations valid for 
C^C, and •/. sufficiently small. 

The relation of p, p' to X makes it clear that positive values of p, p' correspond 
to direct orbits, and that negative values correspond to retrograde orbits. 
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As f l varies from zero we may conceive of the family of retrograde and direct 
orbits as obtained by suitable distortions of the plane from the family of retrograde and 
direct circular orbits respectively. This fact is at once apparent from (42), which 
closely prescribes the nature of the distortions. 

S »3- 

The Ring Transformation for p ^ o. 

For a given value C, of C and for p = o we were led in § 10 to conceive of 
the families of direct and retrograde circular orbits for C ^ C, as generators of the 
two leaves of a ring. The two leaves were joined along corresponding radial lines at 
the origin. 

For |t^o we shall consider the family of auxiliary curves formed by the ana¬ 
lytic continuation of the direct and retrograde orbits for C ^ C, as constituting a 
similar ring. According to the results obtained in the preceding paragraphs this exten¬ 
sion is possible for C, > tfyi if ft be taken sufficiently small. 

The parameters defining a point (p*, 6') of the ring ($ 12) will be taken to be 
p and 0; here p" and 0' are the polar coordinates of a point with reference to the 
body /, while p‘ is the radius of the undisturbed circular orbit (p = o) with the same 
C as the orbit on which (p'*, 0 ') lies, and 0 represents the mean anomaly of the 
particle P in its orbit measured from the crossing of the x-axis. The variables p and p' 
by convention are positive or negative according as (p' 1 , 0') belongs to a direct or 
retrograde orbit. The parameters p, 0 will range over the values 

- o Z 0 Z 2*, 

where a t and a t denote the radii of the retrograde and direct orbits respectively for 
the given value C, of C. 

As a first step toward generalizing the transformation of the ring into itself we 
observe that for u sufficiently small every orbit becomes positively tangent to one of the 
auxiliary curves of the ring within a suitable interval of time t, where t is independent 
of the orbit. 

The statement holds so far as it refers to the case p = o since every rotating 
ellipse becomes positively tangent to one of the circular orbits of the ring at every 
lower apsidal passage. 

If the statement is not true there exist orbits, for p arbitrarily small, which do 
not become positively tangent to the curves of the ring in an arbitrarily long interval 
t of time. Let L be some limiting position of the particle, distinct from /, near the 
beginning of the intervals 7 for a sequence of orbits such that lim p = o, lim r = o:. 

Any limiting orbit of the sequence through L is either a rotating ellipse which 
becomes tangent to an auxiliary circular orbit at regular intervals, or else it coincides 
with one of the circular orbits which limit the ring. 
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The first case cannot arise. For, if that alternative held, a small segment of the 
limiting orbit could be selected lying within the ring for |x = o and cutting across 
the circles in opposite senses at its two extremities. A nearby segment of one of the 
scries of orbits, for some |x ^ o but small, could then be found lying within the ana¬ 
lytic continuation of the ring whose extremities cut the auxiliary curves in opposite 
senses. But this would imply an intermediate point of tangency to one of these curves. 
Such a point does not exist, by hypothesis. 

In making this argument I have purposely ignored the possibility that the limiting 
orbit is ejectional. To understand the nature of this case it is convenient to recur to 
the equations of Levi-Civita (§ 4). The conformal transformation (13) from the x, 
y.planc to the />, q -plane which lead to these equations yields transformed orbits analytic 
near the origin, while the family of auxiliary curves goes over into a family of curves 

about the origin which are approximately circular. In fact ?' = \fx‘ + / is the radius 
vector while 0'/2 is the vectorial angle in the />, f-plane, so that equations (42) yield 
this fact directly. By the phrase « approximately circular • I shall mean that by a suitable 
magnification about the origin these direct and the retrograde curves may be transformed 
into two families of similar curves which go over analytically ioto the unit circle. 

As the orbit in the p, plane is allowed to vary through the origin the radial 
lines to the point of tangency of the orbit with the auxiliary periodic orbit clearly 
comes to a definite limiting position and determines a limiting normal direction. This 
limiting position corresponds to the polar coordinates p' = 0, 0' = or 7^-f* 
in the p, q- plane. There is a unique corresponding direction in the x, y-plane given by 
the polar coordinates p" = 0, 0 ' = 0 O . The orbit through the origin thus obtained 
will be defined to be tangent at the ideal point (o, 0 # ) to tl»e null auxiliary curve. 

With this convention the possibility that the limiting orbit is an orbit of ejection 
is also to be excluded since any orbit passing near J for n small dearly becomes 
positively tangent to an auxiliary curve. 

To exclude the second case we need to make use of a fundamental property of 
the first equation (10) of normal displacement which was found in § 3. 

The coefficients in this linear differential equation of the second order in bn arc 
analytic in the parameter jx, provided that the orbit of reference varies analytically with 
|x. If then, for a particular value of ji, say n # , all of the solutions of the differential 
equations for bn vanish at least once for f 0 <*<<,, all of the solutions in will 
continue to vanish in the same interval of time for |jx — jx 0 | sufficiently small. This 
conclusion applies in particular to a periodic orbit of reference. Here the coefficients of 
the equation are necessarily periodic, having the period of the orbit. 

Owing to the interpretation of the actual normal displacement of a varied orbit 
as equal to e $ n (t small) to terms of higher order, we may infer that, if all the so¬ 
lutions of the equation of normal displacement along a periodic orbit vanish for jx = jx 0 
within some interval r, then all slightly varied orbits for |u — jxJ small will cut the 
analytic continuation of the periodic orbit at least once within the same interval t. 
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Hence, if for p = o the direct or retrograde circular orbit lead to an equation 
for normal displacement whose solutions vanish, every orbit for ja^o lying sufficiently 
near to the continuation of that orbit will cut that ^continuation arbitrarily often. Be¬ 
tween three successive points of cutting there is necessarily at least one point of pos¬ 
itive tangcncy to the family of auxiliary periodic orbits, namely a point on the seg¬ 
ment of the orbit lying within the ring. Hence the second possibility will also be ex¬ 
cluded if the equation of normal displacement for &n has the stated property. 

The precise meaning to be attached to tbe phrase «• one orbit is near another 0 
is that x, y, dx/dl , dy/dt for the two orbits differ by a small quantity throughout 
the interval under consideration. 

But the equations of normal displacement of the direct and retrograde circular 
orbits arc readily found from the equations (10), (11) and (12) to be 


( 5 «) 


d'tn , ,k . d‘6n , * 

-jji- + a , * n = o an<1 -fir H- *J« n = o 


respectively, whose familiar solutions obviously vanish as required. 

To derive the first of these equations we write first the equation of the circular 

orbit 

x (/) = a t cos 1 /, y(f) = a t sin at. 

From these equations we find 

/(o') = a,e , 

where I is the function of w defined by (12) i. e. I = %u/a t a. 

Putting w = s -f t/— 1 n we find 


and thence 


/W-v 





FO, ») = 




Substituting these values for / and F in the first equation (10), and eliminating C by 
means of the second equation (36), wc obtain the equation of normal displacement along 
the direct circular orbit as given above. The equation of normal displacement along -the 
retrograde orbit is found in precisely the same way. 

Wc have thus outlined a complete proof that for ja small any orbit has a point 
ol positive tangcncy to an auxiliary curve of the ring within some interval t of time. 
Thus there will be an infinite number of such points on each orbit. 

For a given |a, taken sufficiently small, no points of positive tangcncy on an orbit 
can coincide or disappear as tbe orbit undergoes continuous variation. This property holds 
for ;a = o since successive points of ungency arise at successive lower apsidal positions 
of the particle. 

Il ja yd : o wc proceed by first observing that it is not necessary to consider the 
immediate vicinity of /. For, the use of the variables />, q of Levi Civita (§ 4) makes 


Ci'/. MU#»>. 1. \\\l\ (|W K «. i-;i ;). — Sia-iipaiO 11 * m.jpo 191 (. 
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it geometrically clear that the nearly rectilinear orbit in the p , ?*plane is positively 
tangent to one and only one of the small approximately circular direct or retrograde 
auxiliary curves about /, and that with variation of the orbit this point of tangency 
varies continuously in the ring. 

Furthermore, if the curvature of the orbit at the point of positive tangency does 
not equal that of the auxiliary orbit, the statement holds. For if ?(x, y) denotes the 
angle which the tangent to the auxiliary curve through (x, y ) makes with the x-axis, 
the condition of positive tangency is 

dy dx . Idx dy . \ 

ro, V = Tt cos? - Tt s,n ? = o \dt cos? + Tt sin? > °) ’ 

where X denotes the parameter on which variation of the orbit depends. If dlV{t, X)/dt 
vanishes also, we should have a double point of tangency. Barring this possibility for 
the moment, we see that W(t t X) = o will have a simple root varying continuously 

with X. 

If we let 9, denote the angle between the tangent to the orbit and the x-axis the 
condition for a double point of tangency may be written 

jj[v sin(9-t.)] = o, 

where v denotes the velocity (not zero in the case that p. is small), and where 9 equals 
Vi at the instant of tangency. From this form of condition we obtain drf/d\ = d?Jdt. 
But, because of the tangency, dyjdt is equal to df/ds, multiplied by ds/dt where s, 
and i denote the arc length along the auxiliary curve and the given orbit respectively. 
Thus d?/ds and dyjds t arc equal, and the condition dlV(t % \)/dt = o is precisely 
the condition for equality of curvatures along the given orbit and the tangent auxiliary 
curve. 

We may deduce an explicit form for this condition. The curvature dy/ds at a 


point of an orbit is 


7*0 (*, y )-C + 


-»,(*.*) «■>» + “,(*.)) 
2U(X,">) - C 


by equations (s). If C' is the value of the constant of Jacobi along the auxiliary 
orbit, at a point where the curvature is the same as for the positively tangent orbit 
with the given C, the angle 9 is the same for both curves, and we obtain readily the 


desired condition 

(_ O. (X, y) sin y + a ,, (x, y) cos ? ) + ^=====) 



We shall only use this equation to note that for C = C' this condition can only 
hold if we have d^/dt = —i by (5). Now for u = o, and along the circular orbits, 
x _| -d?/dt is the angular velocity relative to a fixed plane and cannot be zero. There¬ 
fore for p. small, and for orbits near the analytic continuations of the circular orbits 


723 


THE RESTRICTED PROBLEM OF THREE BODIES. 


4} 


forming the boundaries of the ring equality of curvatures will not be possible; in fact 
under these conditions C' is nearly equal to C. 

It is easily seen further that near the two boundaries of the ring the difference 
in the curvatures of the auxiliary curve and the positively tangent orbit is of the same 
order as the difference C' — C. 

It has now been established that equality of curvatures is not possible near to J 
or the boundaries of the ring, although the difference in curvatures approaches zero 
here. Since the curvatures of the auxiliary curves and of the orbits arc continuous 
functions of x, y, dx/dt } dy/dt and p, except at /, and since the curvatures are 
unequal for p = o, it follows that the curvatures are unequal throughout the ring for 
sufficiently small values of p. 

The periodic orbits for different values of C have been computed by Hill and 
by Darwin. Their results indicate that all the conditions which have been imposed in 
the present paragraph are fulfilled for a wide range of values of p and C ( . 

Let o, 0 be the parametric coordinates of any point K of the ring defined above. 
The orbit positively tangent to the auxiliary curve at K will again be positively tangent 
at a first following point L whose parametric coordinates may be denoted by "p, 

In this way a transformation T of the ring into itself, taking each point K to L is 
obtained; for p = o the transformation takes the explicit form [sec (38)] 

P = P. (f = 0 — 2na‘ 

where p, 0 are polar coordinates, and where a is the function of p defined by the 
equation a — |V — ** = p taken in conjunction with (35). This yields the approximate 
form of the transformation for p small. 

The transformation T thus defined is evidently a precise generalisation of the trans¬ 
formation given in § 10 for the case p = o. 

It remains to give a brief treatment of the transformation T along the boundary 
of the ring. At a point of positive tangency in the neighborhood of this boundary the 
orbit under consideration lies near to the continuation of the circular orbits, and the 
normal displacement (see § 4) is approximately represented by t8n, where t is a small 
constant and An is a solution of the differential equation of normal displacement. On 
the other hand the normal displacement of an auxiliary periodic orbit from the periodic 
orbit forming the boundary of the ring is approximately of the form 9 (f) S C, where 
8 C is the difference between the value of the constant of Jacobi for the auxiliary orbit 
and for the orbit forming a boundary of the ring, and where ?(/) is not zero and 
is periodic. The approximate form of the condition for tangency of the given orbit 
with an auxiliary curve given by an auxiliary periodic orbit is that 8 C can be found 
so that 

c8m = ? (/)8C, 

It is understood of course that all terms but those of the first order are omitted in 
these equations. 
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The condition for this is precisely 


0^ = 0. 

Under the restriction that has been obtained upon the curvature of the auxiliary 
curves these will have curvature differing from that of a positively tangent orbit near 
^he boundary of the ring by terms of the same order as * C above. Thus none of the 
roots of the above equation can be multiple. 

This leads us to the conclusion that, if the orbit of the particle lies near the limit- 
. orbit of the ring and is positively tangent to a curve of the auxiliary family at 
\--t , a root of the above equation lies in the vicinity of t = t 0 . We are also led 
to the* conclusion that the first following point of tangency at t = t, occurs for / 
in the vicinity of the succeeding root of the above equation. 

Consequently, if we choose Sn proportional to that solution of the linear dif¬ 
ferential equation of normal displacement which makes the left-hand member of the 
above equation vanish at / = /„, and if t = I, is the first following root of this equa¬ 
tion, the point of the boundary of the ring corresponding to I = l 0 may be defined 
as carried into the point corresponding to 1 = 1,. By this convention the transformation 
T becomes one-to-one and continuous over the ring inclusive of the boundary. 

Every one-to-one and continuous transformation of a simple closed curve into 
itself is associated with a uniquely determined number * such that if the transforma¬ 
tion be repeated m times (m = o, ± i, ± 2, ...), every point of the curve has made 
a number of complete circuits lying between the greatest integer less than and the 
least integer greater than m x. This fact is apparent if the transformation is the distor¬ 
tion of a rotation, when * is merely the angular rotation effected by the transforma¬ 
tion. We shall term this number x, first considered by PoiNCARfe '®), the coefficient of 


rotation. 

The transformation T takes cither boundary of the ring into itself. We will let 
_ n an J — p stand for the coefficients of rotation along the direct and retrograde 

boundaries of the ring respectively. For |i = o we have p = it. a' , a = 2-a\ according 
to the results of § io. 

Our main conclusions may now be summarized as follows: 

For n sufficiently small and for each value of C ^ C, > V}2 there exists a re¬ 
trograde periodic orbit and a direct periodic orbit about J in the x, y-plane, analytic con¬ 
tinuations of the retrograde and direct circular orbits for ji = o respectively. These orbits 
may be regarded as forming a ring of two leaves in the x, y-plane joined together at the 
point f, one leaf being given by the family of retrograde periodic orbits for C^C,, and 
the other by the family of direct periodic orbits for C^C,. For any point K of a periodic 


««) H. PoiNCARfi, Sur Us courbfS definus par Us equations difftreniielles (j m< partie) fJournal de 
Mathinutiques purcs et appliquies. 4 mt t. I (188;). PP- 167 244J; in particular pp. 220-244. 
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orbit of the ring we have a unique positively tangent orbit, for the given value C, of C, 
which will again become positively tangent at a point I. for the first time. In this way 
a one-to-one continuous transformation T of tlte ring into itself, taking any point K into 
the corresponding point L, is obtained. Tlte transformation T may be so defined as to be 
continuous along the two periodic orbits for C = C, which form the boundaries of the 
ring, and varies continuously with ji from tlte transformation for |* = o obtained in § 10. 

If, as in § i o, we represent the totality of orbits by the non-singular stream lines 
of a certain non-singular three-dimensional flow, a transformation of variables may be 
interpreted as a continuous deformation of these stream lines, and does not alter the 
nature of the dynamical problem. The ring given above may also be represented in 
this space; the points of the ring will correspond to the states of motion of the orbits 
at positive tangcncy whith an auxiliary curve of the ring. This ring may also undergo 
continuous deformation, each point of it being moved along the stream line on which 
it lies. By such deformations the transformation is readily seen to be essentially unal¬ 
tered. 

From this point of view the transformation T determines the dynamical problem. 


S 14- 

Two Properties of the Transformation T. 


The fact that a volhmc integral remains invariant in the representing space (sec 
§§ 2, 6) is reflected directly in a fundamental property of the transformation T : the 

transformation T leaves a certain area integral JJ d*(p, h)dpdO invariant where 

•Kp» °) > ° ,7 )- 

This may be at once seen as follows: Consider the stream lines passing near an 
interior point K of the ring represented in the three-dimensional manifold of states of 


motion. All of the stream lines will cut the ring in one and the same sense (cf. § io) 
and will form a tube. Prolong this tube until the immediately following intersection 
of each stream line with the ring. Denote the area of one base of this tube by A A 
and the area of the other by A A. The points ( 7 , h) of the second base are obtained 
from the points (p, 0 ) of the first base by the transformation T. 

Now let t change to / -f- A t where A / is small. The tube of stream lines is trans¬ 
ferred along itself to a new position. If we let f*(p, &) denote the rate at which a 
point moves along its stream line away from the ring at (p, 0) when t changes, a 
point of cither base will be approximately F( p, b)A t units away from the ring. 

The invariant volume integrals taken over the tube between either base in its 


•7) H. I’oiMCARfi. Let method* s nouvtlUs d* ta SUcaniqut CtUsU, vol III (Paris, Gauthier-Vil- 
lars, 1899), pp. I7J-I78- 
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^ — - — - 

I _ ,1 I ( r f f XIdv (dv the clement of vol- 
first and its displaced position must be equal. If J J J ( 

time, is the invariant volume integral. these two volumes are approx.ma.ely 

MF( f , MFfa «)*'**' 

where M and M stand for the value of M ' “ » of 

spcctively. By letting the dimensions of the base of 

lime approach aero, we see that //«% '0^ «*«■ «« »“" * * *“ 

has the same value as when it U ien over the par. i into which * goes by the 
transformation T. This integral is the invariant :area m«g- of th . 

Furthermore, we may observe that F(r. O) ^ ^ forms 

sr r£vr5«», - *. 

ri " 8 l^T^JCLd .ha, a transformation T of a ring ^ssessing such an 

.-7 -gral Icf, L ^l**'**- 

aT'infinite'TmL of pcrmdic solutions in the restricted problem of three bodies for y. 

•“nShTl > second property of T which, as will be seen later, may be made 
,o yield an immediate proof .ha, there exist an infinite number ot P"'° J,C 

orbits, and upon which a classification of the symmetric orbits maybe 

The transformation T is tlx product of two transformations R, V, each of pernc , 

of which the first is a refiection in tlx X-axis. 

' 1, is the fact that the equations of motion (t), (j) are unaltered when x, y arc 

. . . _Ut.lt m the above additional property. 


It is the fact that the equations ol motion ~ " , 

replaced by x, - y, -« respectively which leads to the above addmona profxtrty. 

This is immediately seen. Let K be a point of positive tangency with one of. 
auxiliary orbits of the ring; let R be the transformation by reflection m the *ax,sw' - . 

takes the ring into itself. We may denote by L = T(/f) the fi-f-a.lowingpoto 
positive tangency (fig. i). The positively tangent orbit at L = *(/-) will then be 


the 

hich 
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point of positive ungency of an orbit first tangent again at K' = T(L') = TR T(K ). 
But a reflection brings K' into coincidence with K on account of the symmetry noted. 
That is, we have 

RTRT =/ 

where / is the transformation which leaves every point invariant. 

Hence the operation R T is of period 2 and will be denoted by U. The trans- 
iormation R and its inverse are identical so that from RT = U we get 

T=RU , (*■ = U‘ = /). 


S *5- 


On the Existence of a Transformation T. 


Suppose now that we postulate merely the existence of a retrograde periodic orbit 
M and a direct periodic orbit N about / without double points or points of intersection. 
In the representing sphere these appear as two lines joining opposite points of its 
surface (§ xo). If either of these lines be varied continuously into the other while its 
extremities remain always opposite, a ring is formed bounded by the two closed lines 
corresponding to the retrograde and direct periodic orbits. 

The ring belonging to the transformation T is precisely of this type, and is cut 
by every stream line in the same sense arvl infinitely often. 

In the present paragraph we shall obtain a necessary and sufficient condition for 
the existence of a ring belonging to a transformation T with these properties. 

Let there be given any family of auxiliary retrograde curves of continuous curv¬ 
ature contracting from M to a null curve at /, and a family of auxiliary direct curves 
expanding from this null curve to N. It will be assumed that the two families are 
without any positive tangcncy, and may be simultaneously deformed into two families 
of concentric circles about / without introducing such a point of tangcncy. Furthermore 
it will be assumed that the small auxiliary curves approach a circular form about / as 
center. 

Let a positive tangency of an orbit to one of these curves be defined as exterior 
or interior according as the orbit lies outside of or inside of the curve at the point. 
The necessary and sufficient condition that a ring bounded by M and N shall exist in 
the representing space, cut by all the stream lines in the same sense and infinitely often , 
is that along every orbit the points of exterior tangency to the auxiliary curves exceed 
those of interior tangency in a sufficiently large fixed interval of time. 

We shall begin by assuming that a family of auxiliary curves satisfying the con¬ 
ditions stated exists, and shall prove that this implies the existence of a ring transfor¬ 
mation T. 

With each point of an auxiliary curve let us associate the positively tangent state 
of motion. Then an auxiliary curve is represented by a line joining opposite points of 
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the "representing sphere, and A. family of lines -Hus obtained correspond, ro a ring 
limited by the lines which correspond to M nA N. deformed into 

The manifold of states of mo,ton for *"? on which opposite 

-r.,rr 

= 1 =. -=■ - -—■— 

in position and direction after the deformation. f „ =0 

P Moreover the ring above obtained may deformed into the ring o § V 

IS 7JT2ZZ ffSK Sl-SS-H 
=‘ £ ZZ{ ; - - v ; - -irsr£=i; 

- s = rrr 5i=t^r OTs ti* 

oiven ring of auxiliary curves is represented by a plane perpendicu 

1 - i ■■ 

isrr.»*. ~ im. rM»*.* .i« •*■*> -I™™ 

~Z d. wu. »» - p~ *“ ~r “ “” J 

*.—* ->• <-«*■«*-?• 

If the points are on the same side of the ring, the ring will necessartly be anequally 

many times in either sense by such a line. If the points are on oppostte s.des 
rinc there will be one more cutting in one sense than in the other. 

8 Consider next a segment of a stream line. If d be the distance »-«n the ,w 
limiting planes perpendicular to the axis which include tins segment betw een then , 
i, is clear that only at most d/2 :: rings congruent ,0 .he g.ven nng cut this segrnen , 
and that the excess of total crossings of the first kind over total cross,ngs of the sew 

ond kind cannot exceed rf/2 5 . c . f irtr 

But our hypothesis ensures that along any orb., the number of pomts of ester,or 

tangency to an auxiliary curve ultimately exceeds those a, inter,or '-^ncy by n 
prescribed integer. Exterior and interior tangency m the plane correspond, however, 
to the two senses of crossing of the ring by the stream ltne m our gcometrw.,1 re- 
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presentation. For, tangenev to the ring in the representing space means equality of 
curvature. This fact may be established just as a similar fact has been in the case 
(a = o (cf. § io). Also, over the direct or retrograde leaf of the ring taken 
separately, any exteriorly tangent curve may be continuously varied into any other 
through a series of such curves. To each such curve corresponds a line of the posit¬ 
ively tangent states of motion in the representing space. Hence all of these lines cross 
in one and the same sense, so far as either leaf alone is concerned. But orbits near 
to / arc always exteriorly tangent to the auxiliary curves, since the curvature of the 
approximately parabolic orbit is about one half that of the approximately circular orbit 
at tangency. Hence one sense of crossing corresponds to one kind of tangency through¬ 
out. Thus after a sufficiently long interval of time the excess of crossings of one 
kind over those of the other becomes arbitrarily large. 

When this excess has attained to d/i r, one can affirm that the terminal point of 
the stream line lies always at least d units from the plane containing the initial point 
and perpendicular to the axis. It becomes apparent in this way that as t increases from 
— oo to -f- oo all the points pass from the one end of the cylinder to the other along 
their stream lines. 

Now let us take each stream line and continue it from t = — oo until it first 
cuts the ring. Remove all half stream lines of this kind which will lie in the first of 
the two parts into which the cylinder is divided. There will be exposed a ring-shaped 
surface nude up of two kinds of regions: parts of the ring cut by a stream line in 
one and the same sense, and regions made up of segments of stream lines. 

But any point of a boundary of the first kind of region is paired with a similar 
point of a similar region, namely the second point on the segment of the stream line 
forming part of the boundary and passing through the first point. By a distortion in 
which each point is moved along its stream line and paired points arc made to coincide, 
a ring is obtained cut always by the trajectories in one and the same sense by every 
stream line. This ring has the desired properties and belongs to a transformation T. 

It is readily seen that the stated condition is necessary. Let us again represent 
the states of motion bv means of the space between two cylinders. We may assume 
that a ring exists belonging to a transformation T, and this ring may be taken as lying 
in a plane perpendicular to the axis of the cylinders. 

But any family of curves having the properties stated will yield a second ring 
representing the states of motion at positive tangency to these curves. This ring may 
be deformed into the first plane ring. The stream lines which pass from one end of 
the cylinder to the other will cross this ring and its congruent images with increasing 
excess of crossings of one kind over those of the other. Since the two kinds of cross¬ 
ings correspond to the two kinds of tangency, this family of curves will satisfy the 
required conditions. 


RnU. Ci’i. .Uaifx. Pth'mt? i. XXXIX (,« ,*1$). - S«»a.p»io U 9 m»«»o I9> S- 
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Part IV. 

PERIODIC ORBITS. 


S 16. 

Existence of Retrograde Periodic Orbits. Hill’s Case. 

A question of fundamental importance for the restricted problem of three bodies 
concerns the existence of the periodic orbits of simplest type for all values of H- and 
C This is also the problem whose solution is preliminary to the doermmat.on 
the ranee of validity of the ring transformation. Up to the present ..me these orb., 
have only been considered for sufficiently small values of P and for suffic.ently large 

ValU “ shadow pass on to establish .ha, the retrograde periodic orbits lo io fact 
X, ‘. exist if there is a closed ova, of aero velocity about : )' and then byth 

results shall consider the more difficult problem afforded by the direct orbit*. Le, 
u. begin with the limiting case of the restricted problem of three bod.es treated by 
Hill [see •)], which is especially simple. 

The equations of motion in this case may be gtven the form 

(i*) »+■$— 

(sy+®y-»^ + T- c 

In passing to this limiting case the units of length, mass and time have been »Uered 
The body J has been brought to the origin and is now of unit mass, the body S h 
moved ou, indefinitely to infinity in the direction of the negative x-ax.s.white a, he 
same time the ratio of the mass S to that of J has indefin.tely mcreased. The x, yplane 
routes with the bodies at unit angular velocity as before. 

These equations enter into the category treated in § 2, with 

Kx, y) = i. u (*. 50 = t x ’ + T - T c - 

If we apply the transformation 

x = />■-?■- J = 2p«, dl = 4 (p‘+ ?■)<'*. 

which is essentially that of LevpCivita, the principles of S 2 enable us to write the 
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transformed equations at once 

(0 - 8 <p' + »')£ = v ^' j? + + *')£ =»\<a ?)■ 


where 




04) 4) = 4 [-*-(>■ - »T + C](/ + ?■)• 

This form of equation is only necessary for our purposes in order to establish 
a simple fact, evident from physical considerations: 

If the orbit of a particle P passes through the neighborhood of J there will be one 
and only one point Q of this part of the orbit for which the distance from J is a mini¬ 
mum. The orbit will be approximately a parabola with Q as vertex and J as focus. If 
the orbit is retrograde near J, it will be of loop form and cut the axis Qf at points 
before and after Q, and near to J, at a small angle. 

The existence of a unique point Q and the approximately parabolic form of the 
orbit follow at once from the approximately rectilinear form in the p , y-plane, and the 
transformation employed. 

We shall therefore pass at once to the proof of the loop form of the retrograde 
orbits. Here we shall confine attention to the typical case in which the point Q lies 
along the X-axis. Under these circumstances the orbit cuts the x-axis at right angles. 
There will be two cases; in one case the point Q lies to the left of /, and in the 
other to the right of /. Inasmuch as the treatment is the same in both cases we shall 
consider only the first, in which the particle is projected from a point Q of the x-axis 
to the left of f in the direction of the positive y-axis. We have merely to establish that 
the particle cuts the x-axis before and after projection at a small angle, near and to 
the right of J. 

With the aid of the transformed differential equations a rigorous analytical proof 
is at once made. The transformation from the x, y-to the p , y-plane takes the positive 
x-axis into the positive p- axis, the negative x-axis into the positive y axis, and the upper 
half ot the x, y-plane into the first quadrant />, y-plane. The positive y axis appears as 
the half line p = y in this quadrant. 

Since the transformation is conformal, the orbit in the p } y-plane begins at a 
point (y o , o) near the origin on the positive y-axis with a velocity nearly 2^2 by the 
last equation (53). For y 0 = o the orbit in the />, y-plane starts from the origin in the 
direction of the positive p- axis. If this orbit lies at first below the p- axis, then, for y 0 
small and positive, the orbit will obviously cut the positive and negative p- axis near 
the origin in the p , y-plane at a small angle. The orbit will thus have the desired 
properties in the x, y-plane. 

But for y 0 = o the initial conditions are 

=0. €± = a tl = - 


ji- = - 2 5 6 /I 
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Hence q is negatively immediately before and after projection i. e. the orbit in llie 
* 9 .plane docs lie at first below the pax is. 

Thus a proof has been given under the stated circumstances. 

The same considerations make it obvious that, in the limiting cusped form, " nc i 
corresponds to collision, the particle P, as seen from /, lies to the left of the axis o 
ihe cusp before collision, and to the right of that axis after collision. ^ 

For our present purposes we shall restrict attention to the case C> 3 - In this 
case only there will be a closed oval, 

, x . + _L-C = o, 


about /, symmetric with respect to both axes, in which motion is possible and along 
whose boundary the velocity is zero. It is easily seen that this oval lies within a circle 

of radius ft about / as center; for if we have r = f \ the left-hand side of the 

above equation is necessarily negative. . , 

Let us suppose now that the particle is projected at any point A — v**» ) 
the left of the origin at right angles to the x-axis and toward the positive y-axis; et 
x, y denote the coordinates of the point at the time /, and let / = o be the time o 
projection. 

From the first equation (52) we obtain 


T,- 2y = Tf)"’ 

by integration between the instant / = o of projection and any value of I. As long 
as x is negative the integral on the right is a positive increasing function o! t. 1 here- 
fore while y^o, the velocity dx/dl will exceed a positive increasing quantity. Con¬ 
sequently a time t may be assigned in advance such that either y = o orx-o for 
some / <t. Let the corresponding point of the orbit be L\ the point I. lies on the 
negative x-axis or the positive y-axis; along K L we have dx/dt>0. 

It is clear that for K near to /, the point L lies on the y-axis, and that the di¬ 
rection of the orbit at L is never parallel to the y-axis. 

As K moves to the left along the x-axis to its extreme position, the point L va- 
ries continuously as long as no new root y = o appears along the segment K L, and 

L docs not approach the origin /. . , ,, 

But a root y = o cannot appear within KL for at the same point we should 
have dx/dt > o, dy/dt = o, d‘y/dt‘^o } which is impossible by the second equa¬ 
tion (5 2 )- , . | v „ 

In consequence the point L and the segment KL vary continuously with A, at 

least until L coincides with J. . 

As the abscissa x e of the point of projection diminishes from zero to its extreme 
negative value, the point K moves from the origin to the oval of zero veloc.ty along 
the negative x-axis. At the outset dy/Jl is positive at L, since the approximately pxtx- 
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bolic orbit cuts the y-axis at a slope approximately i. At the terminal position of K 
on the oval of zero velocity we have 



from the differential equations (52). The particle is projected at velocity zero and 
moves below the axis in this case; the arc KL has neussarily passed through J earlier. 

There is thus an intermediate position of K at which L lies at the origin for 
the first time, and the orbit has cusp form with the axis of the cusp in the second 
quadrant (x Z. o, y > o). In the nearby approximately parabolic form the axis of 
the parabola is essentially in the same direction as that of the cusp. Bearing this in 
mind we see that in any possible position of the axis the slope of the orbit at L lies 
between — 00 and — 1 approximately (fig. 4). This fact may be proved at once by 



a transformation to the variables p t q, t. Hence, previous to the passage of L through 
the origin, the arc K L cuts the y-axis with negative slope. 

That is, dy/dt at L has changed from positive to negative as K passes from the 
origin to the position for which L coincides with the origin. 

At some intermediate position we will have dy/dt = o, and KL cuts the y-axis 
as well as the x-axis perpendicularly. 

The differential equations remain unchanged if x, y, t be replaced by x, —y, —t 
or — x, y, — t respectively. Therefore K L with its successive reflections in the x- 
and y-axes constitutes a complete retrograde periodic orbit. 

Now we note that dy/dt does not vanish between K and L. For, at one of two 
successive roots of dy/dt = o, we have necessarily d‘ y/dt’ ^ o which is impossible 
by the second equation (52) when dx/dt > o. 

In the limiting case of the restricted problem of three bodies treated by Hill, there 

exists for C > 3 ’ a retrograde periodic orbit which makes a single circuit of J within 
the oval of %ero velocity about J, which is symmetric with respect to both axes and such that 
dx/dt , dy/dt do not vanish except along the intersection with the x- and y-axes respectively. 

The expression for the curvature along the orbit together with the facts just 
stated make it obvious that the oval formed by the orbit is convex. 
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^T^lcar that the nature of the preceding argument is largely independent of 
*_ 

the restriction C > } - 

S *7- 

Some Properties of the Curves u,(x, y) = ° and “,(*• >"> = °' 

In order to generalize the argument of the preceding paragraph we need to know 
the nature of the part of the curves U.(*, y) = o, U,(x, y) = oi which lies within 
the oval of zero velocity about J. For the limiting case of S < hese curves reducc 

to the y- and x-axes respectively. 

The simpler equation to consider is il,(x, y) = ° or 

There are no branches of this curve other than y = o within the oval. This is 
obvious in the case p = o. Since the oval does not pass through a point 

«.(*, >) = “,(*> y) = ° 

(see S 7). ^c equation 11, (x, y) = o is the condition that the tangent to the oval be 
parallel to the y-axis. If an additional root of this equation along an oval about J be 
introduced as the ovals vary with p, it will correspond to a point of inflection with 

vertical tangent (i. e. a tangent parallel to the y-axis). . . , 

A point of inflection cannot lie along the x-axis unless U /y (x, y) vanishes along 
that axis; and this condition implies that the coefficient of y in (55) vanishes along the 
x-axis within the oval: 


(«) 


. - ,L=Jt - 


I* — H' 


= O. 


.. 

This is not possible. In the first place if the point (*, o) lies between S and / both 
of the denominators in the left-hand member of this last equation are numerically less 
than I. The two fractions are greater than 1—p and p respectively in consequence, 
and the left-hand member is negative. To treat the possibility x > p or x < — 1 + p, 
we note that U,(x, y) may be written in the alternative forms [sec (26)] 

* a \ 


(5«) 


(x - p)(< - - 7f)+ - tt) 

(*+•-!*)(■-!*)(■- 7 f)- 


\ \ I * • ' 

The first of these two forms shows us that U.(x, y) reduces to p(i — */'’!) al a 
point of inflection along the x-axis, and is therefore positive for x > p. But 11, (x, y) 
is negative for x —p positive and small, and first vanishes at P, (see § 6). I he pos¬ 
sibility x > p, and likewise x < — 1 + p, is accordingly to be excluded. 

At a point y > o of the oval a vertical point of inflection arises also if the 
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equations 12, (x, y) = o, U ?j (x, y) = o are simultaneously satisfied. Hence the partial 
derivative of i2,(x, y)/y as 10 y W, N vanish. But this ratio is the coefficient of y in 
the left-hand member of (5 5), and the partial derivative of this coefficient as to y 
consists of two positive terms. Hence a point of inflection for y > o cannot exist. 

If now we observe that in the vicinity of / for y > o the function U ? (x, y) is 
negative, we can conclude that the function 12, (x, y) vanishes only along the line y = o 
in the oval oj %ero velocity about J, and is negative for y > o in that oval. 

Considerations of symmetry establish that Q ? (x, y) is also negative in the part 
y > o of the oval about 5 . 

The second equation to be considered is Q,(x, >) = °* an ^ ma y be written 


x _ (1 - |Q (* — fO 


f*(* + » - K) _ Q 

r\ 


If we employ the first alternative form (56) for 12, (x, y) we sec that this function 
is positive in the part of the oval about J between the line x — ji=o and the circle 
r , = 1; for, this is the region r, > i, x — n < o and the coefficient of x — jt is 
u ,(*» y)/y < o. Likewise from the second alternative form we sec that 12, (x, y) is 
negative in the part of the oval about 5 between the line x -f- 1 — n = o and the 
circle r, = 1. 

As the constant C of Jacobi diminishes from -{-00 it is clear that at first 12, (x, y) 
vanishes at two and only two points along the oval I2(x, y) — -^C = o which is ap¬ 
proximately a small circle about J as center (sec $ 7); these points will be the points 
at which the tangent to the oval is parallel to the x-axis, and arc symmetrically placed 
with respect to that axis. A single curve branch of the curve 12, (x, y) = o must there¬ 
fore pass through /, with a vertical tangent. 

It has already been noted that for x>ji and y = 0 the first root of 12, (x, y) = o 
lies at P, m(x ft o) outside the oval, and that 12, (x, y) is negative along the part of 
the x-axis in the oval. As y increases from zero, 12, (x, y) increases, since we have 

( 58 ) ,) = iiL ^y-r)7 + i f - fe ii j — £ ? ■ ? , 

Now for any positive x and for large enough y, the function U,(x, y) is positive. 
Consequently there is one and only one point on the curve 12, (x, y) = o for a 
given ia < x < x, above the x-axis. In this way we obtain a closed branch of this 
curve which will of course be symmetric with respect to the x-axis. 

Any radial line from / will cut this branch in one and only one point. Otherwise 
a radial line will be tangent to the curve at some point. This implies 

y) + (*~ y ) = o, 

(«-,)[■+^-)+?r]=0, 

which is not possible for x > u. 
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There is a closed branch of the curve Q.(x, y) = o, untbsntb. ovalof gyro■vcloa'y 
about / and lying in the strip pZxZx,, inter,ecUd by each Une parallel to the y-ax.s 
within the ttrip, in two points symmetrically placed with respect to the ■**•«•* b ' anch 
passes through J vertically bu, through no other point of the conn ‘ 

is cut by a radial line from J in one and only one point. The function Q,(*» >) 
negative to the right of this branch and positive to the left of this branch. 

Considerations of symmetry establish that there is a similar branch through 

to the left of the line x = — i + P- , N „ • lh , 

I shall now pass on to prove that this branch of the curve 12, (x, y) 

only branch within the oval of xero velocity about /. 

For x < - t + a and x > |x we have already excluded the possibility of add.- 

tional branches in the ovals about S and / beyond those already obtained. Furthermore 
wc have noted that U.(x, y) is positive for x<p, r,> . and negative tor *>-■«<+£ 
r > i. Hence we need only consider the region r, Z I, t £ 1 w lc 
ail further pans of the curve 0 .(x, y) = o which can he with,n the ova about S or A 
For p = o we have the limiting branch r, = I in this region, and for p-I we 
have the limiting branch r, = . [see (57)1- I" in<«media.e position the branch 
passes through r, = r. = . i.«. through P.. For p = * - Have a branch x = a 
Moreover for any p. we have one and only one point of a branch Q (x, y) o 
along S/, namely the point P, corresponding to the particular ft (sec § <>)> and lh,s 
point varies from S to J as j* varies from o to i. _ _ 

Now if we make the change of variables x - p = x, y = y and in this way 
transfer the origin to / for any p, the equation U.(x, y) = o becomes linear ,n p. 
Consequently each point of the region r./i.r.Ziin the x, y-planc lies on a 
branch 12 fx, y) = o for at most one value of fx. 

Neglecting certain details of demonstration^ we conclude that as p varies from o 
,o the branches of Q.(x, y) = o in the x, ,-plane form a regular pencil of curves 
through r, = r, = . varying from = . to r, = passing through each pom, of 
the region under consideration once and only once. The curves are clearly symmetric 

with respect to the ?axis and intersect it perpendicularly. 

In this way we have proved that there exists one and only one branch of U.(x, y)_ o 
in the region r, ^ r, ^ 1, and have determined its general character. It remains 
,o prove that this branch is exterior to the ovals about S and / for any value of p. 

Let us pass circles with S and J as centers through P, and le, their rad., be p. 
and P respectively. We shall establish that for any p the branch of Q,(x, y) — o is 
exterior to the circles, and tha, the ovals about S and ] lie wholly within them for 


r , r Z i, and thus complete the proof. 

‘ In the first place if the branch lies partly within one of these ovals, the one 
about / for instance, it will necessarily intersect the circle about J through P,, in 
another point & since the terminal point P lies outside of this circle. At this point 
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we have r f = p ( by definition of p,, and the condition U,( x > y) = o maybe written 

-pV)+ ■*(*+ 1 -io(‘ - 7 [) = 0 - 

Also at P, we have p — x = r, = p,, x -f i — p = r, = p, so that similarly 

-(' -K)p,(i —ji) = °' 

Comparing these equations we obtain 

it is to noted that x -f- i — p and p — x are positive at Q. Now the left-hand side 
of this equation will be increased if r t is replaced by the smaller quantity x -f- i — p. 
By doing this and at the same time replacing p, by its equal i — p a in the right-hand 
side, wc deduce the inequality 

FT«‘-py + <* + « , -ri + ,> 'ir + t + I - 

But we have also x -f- i — p > p, at Q since every point on the circle about / lies 
to the right of P t . Therefore this inequality is impossible. 

It remains to prove that the ovals of zero velocity arc interior to these circles in 
the region r x £ i, r a £ i. 

The least value C, of the constant C of Jacobi yielding an oval of the type un¬ 
der consideration is 2U(x It o) which corresponds to the limiting ovals through P t . 
The ovals to be considered lie in the region 2Q(x, y)—C„^o. This inequality may 
be written 

+i._p ; _2.) + p ( r; + 2._p : _2.)^ 0 . 

Outside of the two circles of radii p t and p ( about S and / respectively in the region 
under consideration, both r ( and r t arc increased beyond their values p ( and p t at P t . 
Both terms of the left hand member of this inequality are thereby diminished, anJ be¬ 
come negative, since r ( and r, are less than i. In consequence this inequality cannot 
hold outside of these circles. 

The proof is now complete. 


s 18. 

Existence of Retrograde Orbits. General Case. 

We are now in a position to establish the existence of a retrograde periodic orbit 
for the general restricted problem of three bodies under the hypothesis that for the 
given value of C the particle lies within a closed oval of zero velocity about /. 

Wc shall prove first: // the particle P within the oval of %pro velocity about J be 

Rt»J. C»»«. Milim. i. XXXIX (i® i*n). — Si.mpno il , mjjglo 191 f. I 
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•* * * V" ^Ttz^T ;/^=0 

Jon dx/dl > O will bold o, Ua,t unltl on, of ,I„ coxduwn, U.(x, >} 

“ “St- -HU we observe firs, that so ,ong as 0 .<x, ,) * po*« (§ - 7 ) we 
have by the first equation (i) 

- * — 2 JJ > ° • 

Irnegrating between the tinte « = o of projection a, /f. when we have dx,d, = 7 = «• 
and the time I, we find 

^-2y>°, 

so that dx/iil remains positive as long as y ^ o and U.(x, y) ^ ° 

Thus by $ .7 the orbit satisfies -he condition dx/dt > o e.ther unt.l a pom. L 
for which , -, = o, , ^ o is reached, or «n„l we have y = » x - P< °. « 
is clear that one of those alternatives actually arises, since we cannot have dx .fl^> 

permanently. For K near to / there is such a point L. . 

Lc, r (- c/a < r < ic/2) denote the angle which the direction of motion at L 

makes with the positive x-axis. As K varies from / to its limiting position on the ova 
of zero velocity, the point L will also vary in such a way that the angle r and the 
ordinate v at L vary analytically. As has been seen above, this variation can only 
terminate if cither y becomes zero at an interior point of A A, or if L coincides ^.ti 
J. But the first possibility may be excluded, inasmuch as the conditions dx <it > o, 
d‘ y/d /' \o arc incompatible with the second equation (i) by § 17. Hence the \ar- 
iation will actually terminate by coincidence of L with /, for in its limiting position 
on the left the projected particle falls at once below the x axis (cf. §16). 

If we recall that for K near to / the value of t is nearly r 4 on account ol the 
form of the orbit ($ 16), and note that in the terminal position, the axis ot the final 
cuspcd orbit is inclined at an angle 0 (*/2^0<-) to the positive x-axis, we can infer 
that the curve V representing the values of r. y in rectangular coordinates is a cont¬ 
inuous curve lying wholly inside of the rectangle — r./2 < t < «/ 2, o^y < y 0 
(y o being the ordinate of the point with abscissa x = jx on the oval of zero velocity 
about /), except for an initial point (*/ 4f o) and a terminal point (- «/ 4 - d, o) 
where o <d £ r/ 4 (fig- 5 >- 
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The equality signs have been excluded on account of the character of the retro¬ 
grade cusped orbit at /; the particle before collision lies to the left of the axis of the 
cusp as seen from /. A rigorous consideration of the nature of the orbit near collision 
is easily made precisely as in § 16, by a use of the equations of Levi-Civita, and 
leads to the same conclusions. 

We shall now pass to the consideration of the orbits terminating at a point R 
on the xaxis to the right of /, and shall prove: If the particle P be projected to a 
terminal position R on the x axis to the right of J in the direction of the negative yaxis, 
the condition dx/dt>o will hold before R, at least until one of the conditions U,(a, y) = o 
or y = o is satisfied. If K is sufficiently near to J, the conditions d x/d I > o, y > 0 
will hold until an earlier instant for which x — p becomes negative. 

Here we observe that, as long as U,(x, y) is negative (§ 17), we have by the 
first equation (1) 



Integrating between the time t and the terminal time t = o when we have dx/dt=y ==0 
we conclude that 

d x 

as before. If now we recall the approximately retrograde parabolic form of orbit for 
K near to /, with / as focus and R as vertex, the truth of the last italicized statement 
becomes manifest also. 

Let now ~L denote the first earlier point of the orbit ending at R for which x — p 
reduces to zero. The point L will certainly exist and will vary analytically with R, 
at least for K near to /. In fact it is apparent that it will vary in this way so long 
as the conditions d x/d t > o, y > o continue to hold along L K. We shall denote the 
angle which the direction of motion at Z makes with the positive x-axis by r, and 
the ordinate of L by y. Also we shall let F denote the curve formed by the set of 
values t, y when K is varied from J to the extreme position for which the conditions 
above enumerated hold. 

It will be proved that the curve f necessarily intersects the curve V (fig. 5). 

This is apparent if the extreme position of K is one for which we have a col¬ 
lision at /. For in this case by the preceding methods we may establish that r is a 
closed curve lying within the rectangle in which V lies save for an initial point 
(— */ 4 * °) anl1 a terminal point (s/4 -f- e, o) where o < e Z. »/4 (fig- 5). The 
curve r extends from a point on one side of the curve T to a point on the other side. 

We have further to consider the case for which L remains distinct from J during 
the period of variation while, however, at the terminal point of T, some point of 
LK satisfies either the condition y = o, or the condition dx/dt = o. 

The possibility y = o may be excluded, for at such a point we must have si- 
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multaneously dx/dt> o, d'y/dt'^o which is in contradiction to the second equation 

Hence we may restrict attention to the possibility dx/dt = o, y> ° al somc 
point of I R. We may exclude the point K as not one for which a further root 
dx/dt = o appears, since the first equation (i) establishes that d‘x/dl is negative at 

If this point is L, the terminal value of r is ± */ 2 , and the curve P extends 
from the point (- */ 4 , o) on one side of the curve T to a point (± r./2, y ) on the 
other side of I* (fig. *)- The curves T and P will intersect in this case also. 

It is, however, impossible for the conditions dx/dt = o, y > o to be satisfied 
first at an interior point M of L K. If this impossibility be established it is clear that 

P and P always intersect. 

In order to establish it we make a division of cases according as dy/dt > o or 
dy/dt Zo at M, the first of which I shall now consider. 

Let us multiply the first equation (i) by y, the second by x —p and subtract the 
two resulting equations, member for member, from each other. We shall thus obtain 

y Tt - 2(x - ^T, - *■>37 = ■*(' - ■%)’■ 

If r x has the same meaning as before, and if 9 , is the angle which the line / P makes 
with the x axis, this equation may be written 

(59) = 

At L on the line x — p = o it is clear that dH t /dt < o, while at M it is clear 
that d 0 ( /d t > o. There is thus a point N on L M for which db i /dt = o, d‘ 9 ,/dt'^o. 
Also from the equation x — p = r, cos 9 , we infer that the condition 


37 - “ r - sin 


«/e, , dr k . 

■ 7( "*■ 37 cos ° 


holds at such a point, from which we have accordingly dr f /dt^o. 

But the right-hand member of (59) is positive at N t while the left-hand member 
reduces to —r\d *Jdt' — 2r % drjdl which is negative or zero at N, so that a con¬ 
tradiction has been reached in this case. 

In the case where we have dx/dt = o, y> o and dy/dt£0 at an interior point 
M of LK we make use of the conditions d‘x/dt‘ = o, d l x/dt l ^o which must 
clearly obtain since the condition d x/d t ^ o holds along L K. From the differential 
equations (1) we infer 

7 F = 1 (- *37 + “.(*• >)) + “..(*» >>37 + “-»<*• y ) d } l - 

At M we find therefore that d l x/dt 1 reduces to 

2Q + 
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Since 12,(x, y) is negative for y > o and since dy/dt is negative or zero, it follows 
that 12, t (x, y) must be negative if d l x/dl* is to be positive or zero. This is not pos¬ 
sible for from (58) 12„(x, y) is positive at Af. 

Thus the curves I' and i‘ obtained will necessarily intersect at one or more points. 

If the particle P passes through the corresponding point Q of the line x = p. 
with the proper value of t, its orbit before the point is reached constitutes an arc 
A L , and its orbit after the point is reached constitutes an arc L R. These two arcs 
together form a retrograde half oval about / along which d x/dt is positive, intersecting 
the x-axis perpendicularly at K and A, and becoming parallel to the x-axis only once; 
in fact if there were more than one point dy/dt =0, the conditions d‘y/dt‘ X o, 
dx.dt > o would hold at one of them, which is impossible by the first equation (1). 

Since the differential equations of motion arc unchanged if x, y, / be replaced by 
x ’ “ 1 respectively, we conclude that the arc K K thus obtained, together with 

its reflection in the x-axis, constitutes a retrograde symmetric periodic orbit. 

lor any value of C for which tltere is a closed oval of zero velocity about /, there 
will exist at least one periodic orbit making a single retrograde circuit of J, symmetric 
with respect to the x-axis, with tangents parallel to the y-axis at two points of the x-axis, 
and with tangents parallel to the x-axis at two symmetrical points. 

It is obvious that similar methods will lead to a proof of the existence of retro¬ 
grade periodic orbits in more general cases. 


S > 9 - 


An Alternative Form of Transformation. Direct Orbits. 


We shall now return briefly to the case |* = o in order to establish the fact that 
it is possible to make a reduction of Ok restricted problem of three bodies to the transfor¬ 
mation of a discoid (by a discoid I mean a two-dimensional region deformable into a 
circle) into itself. 

When (* is zero, the parameters a , 0 , ^ ($ 9) serve to represent the states of 
motion for a given C. These parameters were taken as the cylindrical coordinates of 
a point in space. The totality of states of motion was then represented by the points 
of a hollow cylinder a x Z.a ^ o ^2- where opposite points of the bases 
+ = o and ^ = 2 r: were taken as identical by convention. 

It is convenient for us to replace the variables a, 0 , ^ by 

i=±-±, e = t, + = + - e 

where a , 0, f are again taken as cylindrical coordinates. The region of representation 
is now the ordinary cylinder a Z.(a t — a)/<j, a,, under a similar con¬ 
vention. 

In the a , 0 , + representation the outer and inner cylindrical surfaces correspond 
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,0 ,1. direct and retrograde circular orbits respectively; .be spirals + ♦• - 

a „d constant on these respective surfaces correspond to .be tndmdual 

0> "Tt'The I, *, •} representation tlte surface of the outer cylinder corresponds to the 
retrograde circular orbit and the spirals a* + * = constant on that cylinder correspon 
to the individual states of motion along the orbit. The acts a — o o t tc c) in er 
corresponds 10 the direct circular orbit, and to an individual state a ong * ut or 1 

corresponds the point a = o, $ = constant. 

Here then the parametric representation is only singular along the surlace ot the 
outer cylinder which corresponds to the retrograde circular orbit. 

By (37) the orbits will have the equations 


»+*.. *=[c^+-y+■]'++• 


in these new variables. As / increases indefinitely, each representative point moves in 

a spiral up the cylinder (i. c. lim '{' = +=«)• 

A point (a , o) of the base y = o of the cylinder will thus be taken into a 

definite point ( a\ O’, 2*) of $ = 2* in such a ns ay that 




In the case p = o the restricted problem 0/ three bodies may be associated with the 
one-to-one continuous transformation of a circular discoid into itself. 

It may be noted that, if we refer the states of motion to the space of inversion 
in which the representation is twotoonc, the circular disc is replaced by a discoid 
bounded by the closed curve which represents the retrograde circular orbit. 

Now it is clearly possible to establish a one to one continuous correspondence 
between the states of motion in the case p ^ o when a closed oval ot zero velocity 
surrounds /, and the states of motion in the case p = o, in such wise that the re¬ 
trograde orbit for p ^ o of the type established to exist in the preceding paragraph 
corresponds to the retrograde circular orbit for p = o. l ; or example, if one continuously 
deforms the region of possible motion f».r p ^ o, leaving the vicinity ol the singular 
point f unaltered, so that the oval of zero velocity becomes the circle ol zero velocity 
for u = o, and so that the retrograde orbit for p 9*^ o becomes the retrograde circular 
orbit for p = o, any state of motion for u^o may be set in continuous correspon¬ 
dence with the state of motion for u = o with which it coincides in position and di¬ 
rection after deformation. 

That is, we are entitled to use the representation of the states ol motion in the 
cylinder a Z. (a, — a l )/a i a t , o ^ y ^ 2 :t, for p^o also, where the surface of the 
cylinder is to correspond to the known retrograde orbit, and the individual states of 
motion along this orbit correspond to the lines 2 f J -f- y = constant. 
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Incidentally let us apply this representation to define the term direct periodic orbit 
oj simple type. It is manifestly desirable to include all those orbits which make a single 
direct circuit of J in the x, y-plane, and also orbits obtained by continuous deformation 
of such a direct orbit. But in the case p. = o the axis a = o corresponds to the direct 
circular orbit. A direct periodic orbit (with respect to the given retrograde periodic 
orbit) of simple type may thus be defined as one which corresponds to a stream line 
joining the point (a, 0 , $) with the point (a, 0 , <J>-f 2-). In general a periodic orbit 
is represented by a line joining (a, 0 , $) with (a, 6, + 2 k n). Under these cir¬ 

cumstances the periodic orbit will be said to make k circuits of the given retrograde 
orbit. A direct periodic orbit of simple type is accordingly one which makes a single 
positive circuit of the retrograde orbit (k = 1). 

For n == o every point of the cylinder moves from one end to the other as time 
increases ( d+/dt> 6 ). This property will certainly continue to hold for p. sufficiently 
small. We sec that, as long as this is the case, it is possible to construct a discoid 
across the cylinder which is cut by every stream line in the same sense. To prove 
this we may use the method used in $ 15 for establishing the corresponding theorem 
for the transformation of a ring into itself. 

The transformation T which takes a point K of one of these discoids into a point 
of the congruent discoid displaced upward by a distance 2 -t constitutes clearly a one- 
to-one continuous transformation of a discoid into itself of the same general character 
as the transformation T of a ring-shaped region into itself, and effecting an entirely 
analogous reduction of the restricted problem of three bodies. 

It is not difficult to sec that other transformations of the variable a, 0 , can he 
used to lead us to an infinite number of transformations like T. The transformation T 
is the only one, however, whose construction depends on a retrograde periodic orbit 
alone. 

If it be recalled that a transformation of a discoid into itself always possesses an 
invariant point '*) it is clear that T possesses an invariant point. To such a point 
corresponds necessarily a direct periodic orbit of simple type. Now the transformation 
T will exist if and only if every orbit in a sufficient interval of time makes an arbi¬ 
trary number of positive circuits of the retrograde periodic orbit (cf. § 15). 

If the total number of positive circuits of the retrograde periodic orbit by an arbitrary 
orbit increases indefinitely with the time t there necessarily exists a transformation T, and 
at least one direct periodic orbit of simple type. 

As ,u increases from zero, the direct orbit which is the continuation of the direct 
circular orbit may become unstable by uniting with another direct orbit. This means 
merely lhat two invariant points of T coincide, but presents no other special feature 


'*) L. E. J. Brouwer. Cbtr eintinientigt. sUtigt Transforinatioiun von Fldcbtn in sick [Mathc- 
nutbchc Annalcn, Vol. LXIX (1910), pp. 176-180]. 
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of interest so far as f is concerned. The transformation T of Poincar£, on the con¬ 
trary, depends essentially upon the stability of both the direct and retrograde orbits, 
and will no longer exist after the direct orbit becomes unstable. 

This state of affairs seems to me to make it probable that the restricted problem 
of three bodies admit of reduction to the transformation of a discoid into itself as 
long as there is a closed oval of zero velocity about /, and that in consequence there 
always exists at least one direct periodic orbit of simple type. 


S 20. 

The Symmetric Periodic Orbits. 

We shall now restrict ourselves to the case when ja is so small that the trans¬ 
formation T exists and may be represented in the explicit form given in § IJ, although, 
in reality, it is only the existence of a transformation T in which the boundaries of 
the ring correspond to two orbits symmetric with respect to the x-axis that is essential 
for our purposes. 

We propose to make application of the concept of the ring to prove the existence 
of infinitely many symmetric periodic orbits, and at the same time to develop briefly 
their mutual relation. 

In order to treat the periodic orbits symmetric with respect to the x-axis, we 
observe first that such an orbit crosses the x-axis twice and only twice at right angles. 
For if the orbit crosses at / = /„ and / = l, successively at right angles the complete 
orbit is obtained by a reflection in the x axis and has only two crossings at right 
angles. Hence there arc at most two such crossings. 

Moreover if at / = l„ and I = I, we have symmetric crossings at one and the 
same point, it is clear that at /' = (*. + 0/2 we have one crossing at right angles; 
also if there is one crossing at right angles at l = l' which is made again at l = I 
for the first time, there will be a similar crossing at (/' -f *")/*• Hence there are pre¬ 
cisely two crossings at right angles. 

Now the crossings at right angles may be divided into four classes. The first class 
corresponds to the states of motion x > y = o of the ring; guiding ourselves by 
the analogy with the case }* = o we shall term these states of motion lower passage 
at opposition; for P is in opposition to S. The second class corresponds to crossing at 
right angles for x>j*. y = o not along the ring; this state of motion may be termed 
higher passage at opposition. The third and fourth classes (x<{*, y = o) may likewise 
be termed lower and higher passage at conjunction according as the state of motion 
belongs to the ring or not. 

We shall represent the transformation T on an ordinary ring. 1 his ring may be 
conceived of as obtained from the ring of § I $ by a continuous distortion such that 
the points in the x-axis are carried into points of a new axis and such that pairs of 


745 



IHL RESTRICTED PROBLEM Ol THREE BODIES. 


points symmetrical with respect to that axis arc carried over into pairs symmetrical 
with respect to the new axis. The transformation T may still he written R U where 
K is a reflection in the axis and U is another transformation of period 2 having es¬ 
sentially the same character as R. 



The lower crossings at right angles will be represented by segments or y* 
of the axis according as the crossing is of the first or third class (lig. 6). Here we 
s tall choose *, y on the boundary corresponding to the direct orbit, and Ji, & on the 
boundary corresponding to the retrograde orbit. 

Higher crossings of the second and fourth classes arc not represented in the ring. 
However we may associate with each such crossing the first later state of motion 
along the same orbit which lies on the ring; these form two continuous lines *' 
Y'*' ')• rwo 1,ncs *"&”» Y"*"» symmetrically placed with respect to *'(!', y'$', 

correspond to the first earlier state of motion along these orbits represented by a point 
of the ring (fig. 7). 

In the ease |* = o we arc in a position to completely determine the position of 
the curves y'*\ *'T\ y"*". Let us employ the variables n, * of $ 10. We 
first note that higher passage at opposition corresponds to 0 = *, * = r. Also cqua- 
lions (37) show us that the motion of a particle for which 0 = *, •*, = * at / = o is 
given by 

n' = a, *'=-* + *, + # = <T T / + i:. 

The first point of the ring for / > o is attained for f = 2z, and the curve *'{J' 
has the equation 



This curve lies between a{J(0 = o) and y*(e = :r); for large values of the constant 
C of Jacobi a is small and the segment *'{!’ nearly coincides with yS. Likewise the 
equation of y'S' is found to be 

Thus y'V lies between y$ and xp; for large values of C, y’S' nearly coincides with 
The curves x'js* and y'S’ vary only slightly from these positions when ja is small. 

K'*J. (in. A UUm Pth'm*. i. XXXIX (i« »*«. >«■ 0- - SiMup.io II * iyi;. 9 
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•>/ "« «•'«« «">" W T7T "5 !f.'J n o f r C VV ,0 by n, «he 

—i- r j f ° r - —« —; r/rsi 

auxiliary curves. If, however, « «n e T- RU*c th invirianl . 

- —»-«-»—" 

- S orb,-, is com- 

Plccly characterized by .he fac. .ha. ir hax .wo «. J 

point L of such a sepmen. by MM repe.i.ion of T, .he correspond,n B orb,, ,s sym 

to, tor. ..is, on infinite n.ntor u/ ric pcrtoic m* uu.b 

Ml angle crossings of assigned classes, at least if p be small. f . 

* L le, and be ,he coefficient of ro...ion along .he boundar.es of,he 
ring corresponding to the retrograde and direct periodic orbits respectively (see § «J> 

Since for „ = o we have P—f. « = *7 “ * V^T"' 

r.<,) for small p. Af.er », applications of T .he ro.a„on of b, !».*.* » nd *• «• 

St r will be represen,ed by -up and —. respectively. For „ large enough. ,»p and 
»« will differ by an arbitrarily large quantity. The mth images of ' • ! 

will wind around the ring arbitrarily often. To each crossing of these lines themselves 
bv the mth images of any one of the four lines corresponds a symmetric periodic orbit 
' I shall now proceed to develop more explicit results concerning the existence and 
distribution of the symmetric periodic orbits. In order to do so it is ^stnecessary to 
define two characteristic integers corresponding to each such orbit. It will be assuvr 
in the present paragraph that u is so small that the inequalities o<f ><<<.2 -hold; 
Under this condition no two points of a symmetric pair * , * or K , p or <» * 
or S- S" can coincide, for such a coincidence would imply ,ha, et.hcr p or e was a 
multiple of 2*; it must .be recalled <ha, a', V, f, * are derived from «", {», Y . 
respectively by an application of the transforn.a.ion T. Hence a’, f/, Y ■ * continue 
,o lie in the upper half of .he ring as a, w = o, and a", b". T \ *’ «"«*»« to he m 
its lower half for the same reason (fig. 7). . . . 

Suppose that K is a point in which a given symmetric periodic orbit is positively 
tangent to an auxiliary curve of the ring. Furthermore suppose that k is the least inte¬ 
ger such that T (A’) = K so that the orbit is positively tangent to an auxiliary curve 
of the ring k times before re-entering. Then k will be termed the first characteristic 
integer of the symmetric periodic orbit; in the case a = o, k stands tor the number 
of circuits of the rotating ellipse made before the orbit closes in the x, y-planc. 

After k applications of T the point K will have rotated about the ring a certain 
number of times /. The integer / will be termed the second characteristic integer of the 
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symmetric periodic orbit; in the case f* = o, / stands for the number of rotations of 
the moving x, y-planc during a complete circuit of the orbit in that plane. 

The quotient k/l measures the mean angular velocity of the particle P about J 
with reference to fixed axes. 

We have now to consider the possible combinations of pairs of the four classes 
of right angle crossings which may arise, and to ask in the ten possible eases what 
values ot k and / correspond to existing periodic orbits. Inasmuch as the treatment in 
all cases is essentially the same, the results will only be derived in full for the ease 
in which both crossings are of the first class (lower, at opposition). 

Suppose first that T*(K) = L where K and L are distinct points of afj corre¬ 
sponding to the two right angle crossings. Then we have 

K= r-‘(L) = [U R]'(L)= U(RU )'-[*(£)]. 

= K ' S(i) = L ' ' his yiclds oncc K = r ( / ')' X™« we luvc 
— A, as is to be expected. It is also clear that the equation T l (K) = K has 
X _ 2 k as its least positive integral solution. 

But after k repetitions of 7 the points * and £ regress from their initial positions 
by k * and kf along the inner and outer boundaries of the ring respectively. Conse¬ 
quently, if 2 I- lies between kp and ka, one necessarily infers that the line a|s in its 
final position will have some point K which regresses in an angular sense by exactly 
2 /*, <hat is hes at a point L of ap. 

The corresponding periodic orbit will have 2 k and 2 /, or equal si.bmultiplcs 
there of, ik/\ and 2 /A, for first and second characteristic integers. 

For any k and I such that 

ko < lire < ka 

then exists a symmetric periodic orbit xvith 2 k and 2 1 (or equal submnltiples thereof) 
for first and second characteristic integers respectively , and with both right angles crossing 
of ths same class. * 

The argument given above is made only in the case in which both crossings arc 
lower and at opposition, but applies to the four similar eases without any modification 

Suppose nex, ,h« T(K) = L where K lies on and L on Y S. Here we find 
at once T“(K)= K as before. But after k repetitions of 7 , a and p regress by ka 
and kp along the boundaries of the ring, and so there will be a point of «(i which 
regresses by precisely (2/ + 1):: if (2/ + 1)* lies between kp and ka. Such a point 
appears as a point L of 

Strictly speaking, the existence of a point which regresses by (2/ + 1)- in an 
angular sense has not been proved. If, however, we compare 7 "Va) and 7 * (a) 
noting that v ” 

7 -*(x) = [/? Uf (a) = [R (/]*/? (a) = *[[/*]»(«) = R 7 -»(,), 
it appears that if 7 * (a) has regressed from a by less than (2 / + 1)^ in an angular 
sense, the 7 *(a) has advanced from - by the same amount. But then the application 
of T‘ ,0 7 (a) regresses the point 7 *(a) by less than 2/+ 1 complete cycles. 
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fundamental proper., of coefficients of rotation such »•($»). "« 

l “ ve *«<(2 /+l)=, 

which is impossible. 

for any * and I such that 

*f <(*'+ 0-<** 

.i ur , exists a symmetric periodic orbit with ik and 2 1 + I f" submult,pies 

thereof) for first and second characteristic integers respectively and ti lth right angle crossings 
of the first and third, or second and fourth classes. 

The two similar cases are included in this statement. 

In .ho third place In us suppose .ha. V (K) = L »here K is a pom. of 

and L is a point °< «’>'• Hc,c «• <“« 

K = T‘(L) = [1/*]'(*•) = U r-(Rl.) = il T* -(l), 

since /?(/.) = UR(i) = T-(L). Consequently we have K=R(,K)=T- (/.), and 
, 1 ms finally r'-(K) = K. Tlu. is, at - . is the firs, characiensuc integer. 

Sufficient conditions for such a symmetric periodic orbi. w«h second characiensuc 

integer al+ 2 are <(l| + (jl + 2 ) K <4e. 

For if these conditions are satisfied, after * repetitions of T the point * regresses by 
k e which will be more than I +. complete rotations (see above). Similarly ■{ regresses 
by less than / + j rotations. The *«h image of a point K of a U will therefore surely 
fall a. a point L of O'. *> 'ha. K has regressed by (2/+ 1)* + </, (d < ")• By 
an application of T ' this point goes .0 the symmetrical point of "O ; in tins position 
K has regressed by (2/ + ')* - d N ow because of the symmetry of T and T 
noted above, a further application of T will advance the point by (2/ + 2)* + « 
more. Hence, in all, the angular regress of K after 2*- 1 applications of T is precisely 


(4/ + 2) r. 

For any k and I stick that 

*?<(»'+»)*. (2l+2)*<»e 

,h,rc exists a symmetric periodic orbit with 2* - 1 and 2 / + . (or e,..a/ tubutulliple, 
thereof) for first and second characteristic integers and with right angle crossings of tie 
first and second, or third and fourth classes. 

In precisely similar manner we may prove the following result. 

For any k and l such that 

k a < 2/it, (2/+ l)it<kc 

there exists a symmetric periodic orbit wilhik—i and 2 1 (or equal submultiples thereof) 
for first and second characteristic integers, and with right angle crossings of the first and 
fourth or second and third classes. 

The method above employed can be made to yield a variety of results concerning 
the distribution of the symmetric periodic orbits. We shall confine ourselves here to 
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noting a salient fact concerning the order in which these orbits cross the x axis at 
right angles. For convenience we shall use the four auxiliary curves a *5, y$, y*^ 

corresponding to the four types of crossing, in stating the results. 

V (*.» O and (*,i I.) art pairs of first and second characteristic integers for two 
symmetric periodic orbits belonging to points K t and K t of one of the four curves 
Y^» “'f 4 '* Y *' f or F = O, then symmetric periodic orbits with the same first and second 
characteristic integers (or equal snbmulliples thereof) continue to exist with A' ( and A', in 
the same order unless, as ;x varies from %fro to the given value, one of the rotation num¬ 
bers c, c passes through the value 27 sljk l% 2 -/,/*, respectively. 

For fi = o it has been proved that if a (the semi major axis of the rotating ellipse) 
be taken as parameter along the lines *f>, y*. y'A*. then those orbits arc periodic 

which belong to the states of motion for which a ‘ is rational, and that then the ratio 
of the characteristic numbers k/l is precisely a’ 7 (§ to). 

But these periodic orbits can also be regarded as follows: the Ath image of a 
certain point on one of the lines *£, y«$, y'.,' is taken into a certain point of 

one of these lines alter making / circuits of the ring. To this pair of points corresponds 
the symmetric periodic orbit. For y. = o the intersection of the image and curve is 


unique. 

Now let j* vary from zero. As long as the two curves continue to intersect there 
will be a periodic orbit with the given k , /. An intersection will necessarily exist until 
a point of intersection passes to the boundary of the ring, since the curves cross but 
once for fx = o. 

At p = o there will be but a single point K t and K t with the stated characteristic 
ratios. When fx varies from zero a point K , cannot vary through K, for then we should 
have *.//. =*,//.- Moreover an even number of points A,, K, will be introduced or 
disappear by coincidence, corresponding to the intersections of the curve *|i, y A, 

Y'S' and the images above obtained. Hence points in the same relative order will com 
tinue to exist under the specified circumstances. 


S 21 . 


Some Geometrical Consequences. 


In conclusion I wish to take cognizance of some obvious but curious geometrical 
consequences of the existence of the ring transformation and the fact that it possescs 
an invariant area integral (see § 14). 

All of these are corollaries of the following fact: 

If fx is taken so small that a transformation T exists, and if V be any closed curve 
(not a closed stream line) in the representing splrere (§ 7) there will necessarily exist an 
infinite number of stream lines which pass through a point of V twice. 
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’"^ThTi^f » tomato. If^Tpass along a stream line from a point K of r to 
I first point L of intersection with the ring and then allow K to de J 

1 lint L will describe either an area enclosing curse I”, or an arc of a curve mu tip 
or a single point. The las. case is excluded by the hypothesis that r » no. 

" ' strranl line. In the second case the truth of the s.a.emen. made « obv«,us. 

There remains only the 6rs. case to consider in which the image f bf T 
he ring in two or more parts. Now consider the successive image of I in r, g 
under the transformation T. Because of the existence of a positive integral area in- 

•am over the ring we can infer .ha, T may be repeated so often that some 

image of r* overlaps an Id. image of P (* > 1). It follows then tha. the (* - 0* 

image of r' overlaps I“. Such an image cannot lie wholly within or without T fo 

,ha, 8 would be in contradiction to the existence of a positive Integra area invariant 
Through each point of intersection of the boundary of V and such an image mu 

oass a stream line which intersects V twice. 

Bv this process V is broken up into smaller regions (or is transformed into itselt, 

when the statement to be proved is obviously true). Applying again the same argu¬ 
ment to these smaller regions we are led to the conclusion that there exist an infinite 
number of stream lines passing through 1’ twice. . . 

We include (by convention) the case in which a single orbit cuts I infinitely 

often. 

We have at once the following corollaries: 

a) infinitely many of the orbits pass twice through any given point of the x, 

y-plane not on the oval of zero velocity, f f 

b) infinitely many of the orbits pass twice through a point of the oval of zero 

velocity, 

c) infinitely many of the orbits pass through / twice. 

For in each case the totality of states of motion under consideration will cor¬ 
respond to a closed line in the representing sphere. 


George D. Birkmoff. 
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DYNAMICAL SYSTEMS WITH TWO DECREES OF FREEDOM 

By George D. Birkhoff 

DEPARTMENT OF MATHEMATICS. HARVARD UNIVERSITY 
<*—*•«•* by E. H. Moore. M.rch 12. 1917 

The present note contains a brief summary of a paper with the same 
title which is about to appear in the Transactions of the American Mathe¬ 
matical Society. 

The equations of motion of the dynamical system under considera¬ 
tion are taken in the variational form due to Lagrange 

dt L *' ~ di L *' “ L r " °» 

where x, y represent the two coordinates of the system and where L is 
a quadratic function of their time derivatives x',y'. By an appropriate 
change of variables the principal function L is reduced to 

L - i (** + >■'*) + + fiy' + y, 

and the equations of motion then take the simple form 

x n + X/ = y M , y" -\x’ = y y , (\ = a> - 

involving only the two arbitrary functions X, y, of x, y. In the reversible 
case, i.e., the case when linear terms are lacking in L, we have X = 0. 
The normal form just written is known in this special case, but is new 
in the general case so far as I have been able to determine. Any con¬ 
formal transformation of the variables x, y joined with the correspond¬ 
ing transformation of t leaves this normal form unaltered. 

By means of the transformation theory thus obtained it is established 
that a necessary condition for the existence of an integral linear in x', y' 
is that the curves \/y = const, form an isothermal family in the x, y-plane 
When this condition is satisfied suppose the particular transformation 
of the variables x, y to be made which takes the isothermal family into 
the family y = const.; if and only if a linear integral exists will the 
resulting functions X, y become functions of y alone; and in this case 
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the equations of motion can be directly integrated. As a second appli- 
cation of the transformation theory the equations of displacement are 


^Forteversible problems it is known that the equations of motion can 
always be interpreted as those of a particle constrained to move on a 
fixed smooth surface. In the irreversible case I have been able to show 
that *, y may be regarded as the coordinates of a particle constrained 
to move in a smooth surface which rotates uniformly about a fixed axis 
and carries with it a conservative field of force. 

Among all types of orbits the most fundamentally important are those 

which are periodic. . . ... 

The existence of periodic orbits is intuitively evident in the reversible 

case, when the orbits may be looked upon as geodesics. If proper re¬ 
strictions are imposed a closed geodesic will exist along which the arc 
length is a minimum, and this geodesic will correspond to a periodic 
orbit of minimum type. Orbits of minimum type may also be derived 
by means of an interesting criterion for the reversible case due to Whit¬ 
taker 1 first rigorously established by Signorini. 5 Notwithstanding the 
fact that the integral analogous to arc length may change sign in the 
irreversible case Whittaker stated the direct formal extension of his 
criterion for this case. I have established that such an extension is 
legitimate if X is of one sign, but not otherwise. This restriction on X 
is fulfilled in the restricted problem of three bodies. 

Unfortunately, as Poincard pointed out, only unstable periodic orbits 


can be of minimum type. 

Another method may be employed to find a large class of stable pe 
riodic orbits which I call of minimax type. This entirely new method 
may be stated in a special case as follows: There is a minimum length 
of string, constrained to lie in a given surface of genus 0, which may De 
slipped over that surface. In some intermediate position the string 
will be taut and will then coincide with a closed geodesic 4 . 

Poincare has proved that a closed geodesic exists on any convex sur¬ 


face by an entirely different method. 

A *hird method for the discovery of periodic orbits is that of analytic 
continuation. Hitherto the application of this method has been limited 
by the restriction that the variation of the parameter involved be ‘suf¬ 
ficiently small.’ This restriction turns out to be unnecessary in the 
reversible case if the orbits near any orbit always intersect it. If the 


conditions 


X>0, *y>0, A (log y) >0 


hold in the irreversible case the restriction is also unnecessary. 
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An application of periodic orbits which is fundamental lies in the 
construction of surfaces of section. The existence of a ring-shaped 
surface of this sort was noted by Poincare in the restricted problem 
of three bodies and allied dynamical problems. The results of my 
paper show that such surfaces exist in a wide variety of cases and may 
be of any genus and possess any number of boundaries. By means of 
a surface of section the dynamical problem reduces to a one-to-one 
analytic area-preserving transformation of the surface of section into 
itself. 

Periodic orbits correspond to invariant points of this transformation. 
Consequently the discovery of further periodic orbits hinges upon the 
proof of the existence of such invariant points. Two theorems con¬ 
cerning such points are proved by me. 

The first of these theorems is the following: If a surface of genus p 
admits of a one-to-one analytic transformation into itself which may 
be looked upon as obtained by a deformation then the excess of com¬ 
pletely unstable invariant points over all other types of invariant points 
is precisely 2 p — 2. 

If the transformation is merely restricted to be continuous the same 
argument shows that there exists at least one invariant point for p4=l. 
For the case p = 0 this last result constitutes a well-known theorem 
due to Brouwer. 3 

The dynamical application of the result for the analytic case is to a 
fundamental equality obtaining between the various types of periodic 
orbits. 

The second theorem is the following: If the region outside of a circle 
in the plane is transformed into itself in such a way that points on the 
circumference are advanced in one sense, so that points distant from the 
center are regressed by more than a fixed positive angle about the center, 
and so that areas are preserved, then at least two points are left in¬ 
variant by the transformation. This theorem is to be regarded as the 
extension of a theorem stated by Poincar6 and proved by me. 4 

These two related theorems are used to prove that infinitely many 
periodic orbits exist whenever a surface of section is at hand. 


•Whittaker, Analytical Dynamics, pp. 376 - 384 , Cambridge, 1904 . 

* Signorini, Palermo, Rend. Cite. Mat., 33 , 1912 , ( 187 - 193 ). 

* Brouwer, Math. Ann., Leipzig, 69 , 1910 , ( 176 - 180 ). 

4 Birkhoft, Trans. Amer. Math. Soc., New York. ~ .. 
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